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(57) Abstract 

A method of increasing the production of lovastatin or monacolin J in a lovastatin-producing or non-lovastatin-producing organism 
is disclosed. In one embodiment, the method comprises the steps of transforming an organism with the A. terreus D4B segment, wherein 
the segment is translated and where an increase in lovastatin production occurs. 
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METHOD OF PRODUCING ANTIHYPERCHOLESTEROLEMIC AGENTS 



CROSS-REFERENCES TO RELATED APPLICATION 

Not applicable. 

STATEMENT REGARDING FEDERALLY SPONSORED 
RESEARCH AND DEVELOPMENT 

5 This invention was made with United States 

government support awarded by the following agencies: NIH 
Grant No: AI43031. The United States has certain rights 
in this invention. 

BACKGROUND OF THE INVENTION 

10 Cholesterol and other lipids are transported in body 

fluids by low-density lipoproteins (LDL) and high-density 
lipoproteins (HDL). Substances that effectuate 
mechanisms for lowering LDL- cholesterol may serve as 
effective ant ihypercholes t erolemic agents because LDL 

15 levels are positively correlated with the risk of 
coronary artery disease. 

MEVACOR (lovastatin; mevinolin) and ZOCOR 
(simvastatin) are members of a group of active 
ant ihypercholesterolemic agents that function by 

20 inhibiting the ra te - 1 imi t ing step in cellular cholesterol 
biosynthesis, namely the conversion of 

hydroxymethylglutarylcoenzyme A (HMG-CoA) into mevalonate 
by HMG-CoA reductase. 

The general biosynthetic pathway of a naturally 
25 occurring HMG-CoA reductase inhibitor has been outlined 
by Moore, et aJL . , who showed that the biosynthesis of 
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mevinc-lin ( lovastatin) by Aspergillus cerreus ATCC 20542 
begins with acetate and proceeds via a polyketide pathway 
(R.N. Moore, ec al . , Amer . Chem . Soc . 107:3694-3701, 

1985) . Endo, e_t al ■ described similar biosynthetic 
5 pathways in Pencillium citrinum NRRL 8082 and Monascus 

ruber M-4 681 ( A . Y . Endo, et al., Ant ibiot . 38:444-448, 

1985) . 

The recent commercial introduction of microbial HMG- 
CoA reductase inhibitors has fostered a need for high 
10 yielding production processes. Methods of improving 
process yield have included scaling up the process, 
improving the culture medium and simplifying the 
isolat ion . 

Previous attempts to increase the biosynthesis of 
15 HMG-CoA reductase inhibitors at the level of gene 

expression have focused on increasing the concentration 
triol polyketide synthase (TPKS) , a multifunctional 
protein with at least six activities as evidenced by the 
product of the enzymatic activity (Moore, supra , 1985) . 
20 TPKS is believed to be the rate-limiting enzymatic 
act ivi ty ( ies ) in the biosynthesis of the HMG-CoA 
reductase inhibitor compounds. 

U.S. patent 5,744,350 identifies a DNA encoding 
triol polyketide synthase (TPKS) from Aspergillus 
2 5 terreus. "NPKS" is now preferred to TPKS as the acronym 
for "nonaketide polyketide synthase." 



-2 - 
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. SUMMARY OF THE INVENTION 

In one embodiment, the present invention is a method 
of increasing the production of lovastatin in a 
lovastat in-producing organism. The method comprises the 
5 steps of transforming the organism with a nucleic acid 
sequence comprising the D4B segment, preferably- 
comprising nucleotides 579 - 33,000 of SEQ ID NO: 18 and 1 
- 5,349 of SEQ ID NO: 19.. The nucleic acid sequence is 
transcribed and translated and an increase in lovastatin 
10 production occurs. Preferably, this increase is at least 
2-fold. 

In a preferred form of the present invention, the 
lovastat in-producing organism is selected from the group 
consisting A. terreus ATCC 20542 and ATCC 20541. 

15 In another embodiment, the method comprises the step 

of transforming the organism with the corresponding D4B 
segment isolated from a non-A. terreus lovastat in- 
producing organism . 

In another embodiment, the present invention is a 

20 method of increasing the production of lovastatin in a 
lovastatin-producing organism, comprising the step of 
transforming the organism with the LovE gene, wherein the 
nucleic acid sequence is transcribed and translated and 
wherein an increase in lovastatin production occurs. 

25 In another embodiment of the present invention, one 

may increase the production of monacolin J in a non- 
lovastat in-producing organism comprising the steps of 
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transforming the organism .with a nucleic acid seouence 
comprising the D4B segment . As a further step, one may 
additionally transform the organism with an entire LovF 
gene. If the entire LovF gene is added to the D4B 

5 segment, the organism will produce lovastatin. 

In another embodiment , the present invention is the 
lovastat in product ion gene cluster , preferably SEQ ID 
NOs : 18 and 19 , and the individual genes comprising that 
cluster . 

10 It is an object of the present invention to provide 

a method for increasing lovastatin and monacol in J 
product ion in both lovastat in- producing and non- 
lovastatin producing organisms . 

Other ob j ects, features and advantages of the 

15 present invent ion wil 1 become apparent after review of 
the specification, claims and drawings. 

DESCRIPTION OF DRAWINGS 

Fig. 1 is a diagram of lovastatin production genes. 

Fig . 2 is a schematic diagram of a hypothetical 
2 0 mevinol in/ lovastat in biosynthesis pathway . 

Fig. 3 is a comparative diagram of statins. 

Fig . 4 is a schema t ic drawing of plasmid 
pWHM1264/CB24A. 

Fig. 5 is a schematic drawing of plasmid pWHM1424. 

25 Fig. 6 is a schematic drawing of plasmid 

CD4B/pWHM1263 . 
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DESCRIPTION OF THE INVENTION 

In General 

The Examples below disclose the cloning and 

sequencing of a cluster of 17 genes from A. terreus ATCC 
5 20542, a strain that natively produces lovastatin (See 

Fig. 1) . These genes flank the NPKS gene, which is known 

to be required for lovastatin production (see, for 

example, U.S. patent 5,744,350). 

The DNA sequence of the cluster has been determined 
10 and is disclosed below at SEQ ID NOs:18 and 19. 

Mutations in four of the genes (P450I/LOVA, SEQ ID NO:22; 

dehydrogenase/LovC, SEQ ID NO: 24; esterase/LovD, SEQ ID ; 

NO: 2 5; and ScPKS/LovF, SEQ ID NO: 29) have been isolated 

and demonstrate that each of these four individual genes 
15 is required for lovastatin production. These genes are 

indicated with an X symbol in Fig. 1 and referred to 

herein as the U A. terreus lovastatin gene cluster." 
Another of the genes (Zn Finger I/Lov£, SEQ ID 

NO: 27) is thought to regulate the transcription of the 
20 other genes and causes a notable increase in lovastatin 

production when reintroduced into A . terreus ATCC 20542. 
Applicants have used the following convention in 

naming the genes and proteins of the present invention. 

The genes and proteins are first named with either an 
25 u ORF" or "Lov" prefix and then named either numerically 

or alphabetically. "Lov" signifies genes shown to be 

essential for lovastatin production. Applicants have 

-5- 
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also included a descriptor name that describes a probable 
function of the protein. For example, SEQ ID NO : 1 is 
described as the "ORFl/esterase- like protein" because 
Applicants have compared the amino acid sequence to known 
5 esterases. 

The portion of the gene cluster between 
ORFl/esterase-like protein and the mid-region of 
LovF/SCPKS is referred to as the "D4B segment" . The A. 
terreus D4B segment is contained within a plasmid clone 

10 deposited as ATCC 98876. As described below, other 

lovastat in-producing organisms contain an analogous D4B 
segment comprising analogous genes. The present 
invention comprises a "D4B segment" isolated from other 
lovastatin-producing organisms. The arrangment of the 

15 genes within the D4B segment may be different in other 

organisms. We predict that the genes within these other 
segments will have at least 80% homology, at the nucleic 
acid level, with the genes disclosed herein. We envision 
that each of these lovastat in-producing organisms will 

2 0 comprise within their genomes a LovA, LovB, LovC, LovD, 
LovE and LovF gene . 

We have determined that the D4B segment will confer 
production of monocolin J if the genes are all expressed, 
as we show below in an example using A . nidulans . We 

25 envision that adding the LovF gene to the D4B segment 
genes will result in the production of lovastatin in a 
non- lovastat in -producing organism. 
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Table 1, below, summarizes information regarding che 
different protein and nucleic acid sequences of the 
present invention. SEQ ID NOs:l-17 are predicted 
translation products of various members of the gene 
cluster. SEQ ID NOs:18 and 19 are the entire DNA 
sequence of the gene cluster. SEQ ID NOs:21-36 are the 
genomic DNA sequences of the various members of the gene 
cluster and include the introns . These DNA sequences are 
reported in the Sequence Listing in the 5' - 3' 
orientation, although, as Fig. 1 indicates, some of these 
DNA sequences are in the inverted orientation in the 
actual cluster. 



TABLE 1 



SEQ ID NO. 


DESCRIPTION 


COMMENTS 


SEQ ID NO: 1 


Predicted amino acid sequence 
of ORFl/Esterase-Iike protein 


Translation of 6 EXONS 6865- 
6568, 6462-5584, 5520-4822, 4774- 
3511,3332-2372, 2301-1813 
(reverse complement) 
FROM SEQ ID NO: 18 


SEQ ID NO: 2 


Predicted amino acid sequence 
ofORF2 


Translation of 1 EXON 7616-8602 
FROM SEQ ID NO: 18 


SEQ ID NO: 3 


Predicted amino acid sequence 
ofLovA/P4501 protein 


Translation of 1 EXON 10951-9980 
(reverse complement) 
FROM SEQ ID NO: 18 


SEQ ID NO: 4 


Predicted amino acid sequence 
ofORF5 


Translation of 1 EXON 22760- 
21990 (reverse complement) 
FROM SEQ ID NO: 18 


SEQ ID NO: 5 


Predicted amino acid sequence 
of LovC/Dehydrogenase 


Translation of 3 EXONS 23158- 
23717. 23801-23912, 23991-24410 
FROM SEQ ID NO: 18 


SEQ ID NO: 6 


Predicted amino acid sequence 
of LovD/Esterase 


Translation of 3 EXONS 26203- 
26080, 26005-25017, 24938-24810 
(reverse complement) 
FROM SEQ ID NO: 18 
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SEQ ID NO. 


DESCRIPTION 


COMMENTS 


SEQ ID NO: 7 


Predicted amino acid sequence 
of ORF8/HMG Co A Reductase 


Translation of 5 EXONS 3006?- 

29882, 29803-29745, 29664-^71 19 

27035-26779, 26722-26559 (reverse 

complement) 

FROM SEQ ID NO: 18 


SEQ ID NO: 8 


Predicted amino acid sequence 
of LovE/Zn Finger I 


Translation of 1 EXON 31360- 
32871 

FROM SEQ ID NO: 18 


SEQ ID NO: 9 


Predicted amino acid sequence 
of ORFIO/Metabolite transport 


Translation of 8 EXONS 1400- 
1452, 1619-1695. 1770-1996. 2065- 
2088, 2154-2225, 2332-2865. 2939- 
3099, 3180-3560 
FROM SEQ ID NO: 19 


SEQ ID NO: 10 


Predicted amino acid sequence 
ofLovF/ScPKS 


Translation of 7 EXONS 4430- 
4627, 4709-4795, 4870-4927. 4985- 
5318, 5405-5912, 5986-6565. 6631- 
12464 

FROM SEQ ID NO. 19 


SEQ ID NO: 1 1 


Predicted amino acid sequence 
of ORF12 


Translation of 3 EXONS 13596- 
13496, 13451-13063, 12968-12709 
(reverse complement) 
FROM SEQ ID NO: 19 


SEQ ID NO: 12 


Predicted amino acid sequence 
ofORF13/Zn Finger II 


Translation of 5 EXONS 16608- 

16463, 16376-15572, 15519-15346, 

15291-14825, 14767-14131 (reverse 

complement) 

FROM SEQ ID NO: 19 


SEQ ID NO: 13 


Predicted amino acid sequence 
ofORF 14/ Acetyl CoA 
transport protein 


Translation of 7 EXONS 19642- 

19571, 19502-19427, 19352-19227, 

19158-19011, 18956-18663. 18587- 

18438, 18380-18341 (reverse 

complement) 

FROM SEQ ID NO: 19 


SEQ ID NO: 14 


Predicted amino acid sequence 
ofORF15 


Translation of 2 EXONS 20332- 
20574, 20631-21860 
FROM SEQ ID NO: 19 


SEQ ID NO: 15 


Predicted amino acid sequence 
of ORF 16/Membrane transport 
protein 


Translation of 5 EXONS 24521- 

24054, 23996-23936, 23876-23184, 

23 1 1 1 -22977, 22924-228 1 8 (reverse 

complement) 

FROM SEQ ID NO: 19 


SEQ ID NO: 16 


Predicted amino acid sequence 
of ORF17/P450II protein 


Translation of 3 EXONS 28525- 
27673, 27606-27284, 2721 1-26837 
(reverse complement) 
FROM SEQ ID NO: 19 
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SEQ ID NO. 


DESCRIPTION 


COMMENTS 


SEQ ID NO: 17 


Predicted amino acid sequence 
of ORE 18 (incomplete) 


Translation of 2 EXONS 29826- 
30995. 31054-31328 (incomplete) 
FROM SEQ ID NO: 19 


SEQ ID NO: 18 


DNA sequence of gene cluster- 
first 33,000 nucleotides 




bbv ID NvJ. 1 y 


DNA sequence of cluster- 
nucleotides 33,001-64,328 
(renumbered 1-31,328) 




bhQ ID NU. zU 


DNA sequence of 
ORFl/Esterase-like gene 


otan — ooOj 
Stop =1813 
SEQ ID NO: 18 


SEQ ID NO: 21 


DNA sequence oi ORF2 


Start = 7616 
Stop = 8602 
SEQ ID NO: 18 


SEQ ID NO: 22 


DNA sequence oi LovA/P450I 
gene 


Start — 1095 1 
Stop = 9980 
SEQ ID NO: 18 


SEQ ID NO: 23 


DNA sequence ot UKr_> 


s>tart — 12. /oU 
Stop = 21990 
SEQ ID NO: 1 8 


SEQ ID NO: 24 


DNA sequence of 
LovC/Dehydrogenese 


Start = 2315a 
Stop = 24410 
SEQ ID NO: 18 


SEQ ID NO: 25 


DNA sequence of 
LovD/Esterase 


Start = 248 1 0 
Stop = 26203 
SEQ ID NO: 18 


SEQ ID NO: 26 


DNA sequence of ORro/HMCi 
CoA Reductase 


btart = JOUoz 
Stop = 26559 
SEQ ID NO: 18 


SEQ ID NO: 27 


DNA sequence of LovE/Zn 
Finger I 


Mart — 3 1 360 
Stop = 32871 
SEQ ID NO: 18 


ShQ ID NO: 2o 


DNA sequence of 
ORFIO/Metabolite transport 


O * „ 1 Af\C\ 

Start — 1 4U0 
Stop = 3560 
SEQ ID NO: 19 




ltn/\ sequence oi Lovr/ocrrto 


Ctar-t = AA'XO 

oiarr — *-¥Hj\j 
Stop = 12464 
SEQ ID NO: 19 


SEQ ID NO: 30 


DNA sequence of ORF12 


Start = 13596 
Stop= 12709 
SEQ ID NO: 19 
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SEQ ID NO. 


DESCRIPTION 


COMMENTS 


SEQ ID NO: 31 


DNA sequence of ORF13/Zn 
Finger II 


Start = 16608 
Stop= 14131 
SEQ ID NO:19 


SEQ ID NO: 32 


DNA sequence of 

ORF 1 4/Acetyl CoA transport 

gene 


Start = 1 9642 
Stop= 18341 
SEQ ID NO: 19 


SEQ ID NO: 33 


DNA sequence of ORF 15 


Start = 20332 
Stop = 21860 
SEQ ID NO: 19 


SEQ ID NO: 34 


DNA sequence of 

ORF 1 6/Membrane transport 

protein 


Start = 24521 
Stop = 22818 
SEQ ID NO: 19 


SEQ ID NO: 35 


DNA sequence of 
ORF17/P450II gene 


Start = 28525 
Stop = 26837 
SEQ ID NO: 19 


SEQ ID NO: 36 


DNA sequence of ORF 18 
(incomplete) 


Start = 29826 

to 31328 (incomplete) 

SEQ ID NO: 19 



Table 1 also notes the translation start and stop 
points in the various gene sequences. 

10 The sequence of the NPKS gene is not listed in SEQ 

ID NOs:21-36. This gene is characterized in U.S. patent 
5,744,350. However, SEQ ID NOs:18 and 19 do contain the 
sequence of the NPKS gene within the context of the 
entire gene cluster. 

15 To perform many embodiments of the present 

invention, one will need to recreate various genes or a 
portion of the gene cluster described herein. Applicants 
have provided sequence data in the Sequence Listing 
sufficient to allow one of skill in the art to construct 

20 numerous probes suitable to recreate the genes from an A. 
terreus genomic library. Applicants have also described 
below various methods for isolating A. terreus DNA. 

-10- 
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Additionally, Applicants have deposited ATCC 
Accession No. ATCC 98876, which contains clone pWHM1263 
(cD4B) and ATCC Accession No. ATCC 98877 which contains 
clone pWHM1265 (CB2A4) . Both plasmids are described in 
5 more detail below. Fig. 4 describes clone 

CB2A4/pWHM1265 , and Fig. 6 describes clone CB4B/pWHM1263 . 
Fig. 1 also indicates the boundaries of the D4B and B2A4 
clones . 

The clones and their inserts may be prepared from 
10 the ATCC deposits by methods known to those of skill in 

the art . The DNA from the clones may be isolated and any 
gene within the gene cluster may be isolated and 
utilized. 

Increasing the Production of Lovastatin by Lovastatin- 
15 producing Fungi and Yeast 

In one embodiment, the present invention is a method 
of increasing the production of lovastatin in a 
lovastat in-producing fungi and yeast, preferably A. 
terreus ATCC20542 and ATCC20541. Other examples of 
20 suitable lovastatin-producing fungi and yeast are listed 
in Table 2, below. 
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Microorganisms other than A. terreus reported to produce lovastatin Imevinolin) 

Monascus (17 of 124 strains screened) species' 

M. ruber 
M. purpureas 
M. pilosus 
M. vitreus 
M. pubigerus 

Peniciliium sp. 1 
Hypomyces sp. 
Doratomyces sp. 
Phoma sp. 
Eupeniciliium sp. 
Gymnoascus sp. 
Trichoderma sp. 

Pichia labacensis 2 
Candida cariosilognicola 

Aspergilus oryzea 3 
Doratomyces stemonitis 
Paecilomyces virioti 
Peniciliium citrinum 
Penicillin chrysogenum 
Scopuiariopsis brevicauiis 
Trichoderma viride 



1. P. Juzlova, L. Martinkova, V. Kren. Secondary Metabolites of the fungus Monascus: a review. 
J. ind. Microbiol. 1 6:163-1 70 and references cited therein ( 1 996). 

2. N. Gunde-Cimerman, A. Plemenitas and A. Cimerman. A hydroxymethyigiutaryl-CoA reductase 
inhibitor synthesized by yeasts. FEMS Microbiol. Lett. 132:39-43 (1995). 

3. A.A. Shindia. Mevinolin production by some fungi. Folio Microbiol. 42:477-480 (1997). 



By "increasing the production" we mean that the 
amount of lovastatin produced is increased by at least 2- 
fold, preferably by at least 5-fold. The examples below 
35 demonstrate two preferred methods for analyzing strains 
for lovastatin production. In method A, the spore 
suspension is inoculated into a flask of SEED medium and 
grown. The resulting seed culture is used to inoculate 
FM media and grown for six days. In fermentation method 
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B , one inoculates 50 ml of RPM media and grows this 
larger culture for 7 days. 

Both cultures are extracted, pH adjusted, mixed with 
ethyl acetate and shaken for two hours. For analysis, 1 
5 ml of the ethyl acetate layer is dried under a nitrogen 
stream and resuspended in methanol. For TLC analysis, a 
small amount of the extract is run on CI 8 reverse phase 
TLC plates in a solvent system of methanol; 0.1% 
phosphoric acid. The TLC plates are developed by 

10 spraying with phosphomolybdic acid in methanol and 

heating with a heat gun. The extracts are compared with 
authentic lovastatin, monacolin J, monacolin L and 
dihydromonocolon L. 

If one wishes HPLC analysis, the examples below 

15 describe the use of a Waters Nova-Pak C18 column used 
with a solvent system of acetonitrile and phosphoric 
acid. A Waters 996 Photodiode Array Detector will detect 
the metabolites. Lovastatin was detected at 238 nm. 

In one embodiment, one would transform a lovastatin- 

20 producing fungi or yeast with the lovE/zinc finger I 
gene, preferably comprising the nucleotides of SEQ ID 
NO: 27. The examples below predict that this will result 
in an increase of at least 5-7 fold. Preferably, the 
increase will be at least 2.0-fold. 

25 One may also transform a lovastat in-producing fungi 

or yeast with the LovE gene isolated from other 
lovastat in-producing fungi or yeast. One may obtain this 
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gene by use of a probe derived from SEQ ID NO: 27 by 
methods known to those of skill in the art. 

One may also transform lovastat in-producing fungi 
and yeast with the D4B segment of the lovastatin 
5 production gene cluster (see Fig. 1), preferably as found 
in ATCC accession number 98876. Alternatively, one may 
transform lovastat in-producing fungi or yeast with the 
entire gene cluster, as diagramed in Fig. 1. 

We envision that to successfully increase lovastatin 

10 production, one may also wish to transform less than the 
entire gene cluster. Preferably, one may determine what 
the smallest possible segment is by deleting various 
portions of the gene cluster and determining whether 
lovastatin production is continually increased. 

15 Similarly, if one begins with the D4B segment, one may 
delete various portions for the segment and determine 
whether lovastatin production is continually increased by 
at least 2-fold. 

Modification of the LovB/NPKS gene would produce 

20 other HMG CoA inhibitors. For example, Fig. 3 diagrams 
the relationship between mevastatin, lovastatin, 
simvastatin and pravastatin. In one example, the methyl 
transferase domain of the NPKS gene may be replaced with 
an inactive form to make pravastatin. The HMG -CoA 

25 reductase inhibitors within this invention include, but 
are not limited to, compactin (ML-236B) , lovastatin, 
simvastatin, pravastatin and mevastatin. 
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In another embodiment of the present invention, one 
may transform a lovastat in-producing organism with the 
genes described above and obtain the production of an HMG 
CoA reductase inhibitor with a structure different from 
5 monacolin J, monacolin L or lovastatin. Alterations in 
the side chain attached to C8 are the most likely- 
possibility but other alterations may occur. These 
alterations would happen through the native biochemistry 
of the organism. 

10 If one wishes to express the A . terreus genes in 

yeast, one may wish to consult examples in which others 
have engineered fungal secondary metabolism genes for 
expression in yeast. (See for example, J. T. t Kealey, £t 
al . , Proc . Natl . Acad. Sci . USA 95:505-509 (1998) ) . The 

15 exact approach could be used with the NPKS (LovB) and 

ScpKS (LovF) genes, and a somewhat simpler approach with 
the other lovastatin genes in their cDNA forms. 

Production of HMG- CoA Reductase Inhibitors by Fungi and 
Yeast That Do Not Natively Produce Inhibitors . 

20 In another embodiment, the present invention is the 

production of HMG - CoA reductase inhibitors, such as 
lovastatin, by fungi and yeast that do not natively 
produce lovastatin. An example of a suitable fungi or 
yeast is A. nidulans and S. cerevisiae, respectively. 

25 For this embodiment one preferably transforms the 

genes within the D4B segment into the non- inhibitor- 
producing strain. By this method, one would produce 
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monacolin J (See Fig. 2) which could be chemically 
converted to lovastatin by one of skill in the art. 

Monacolin J, in its lactone form obtained by 
treatment with anhydrous acid under dehydrative 
5 conditions, is preferably treated with a derivative of 

(2S) -2 -methybutyric acid, in which the carboxyl group has 
been suitable activated for undergoing esterif ication, 
and the resulting lovastatin is isolated by conventional 
methods. For example, see WO 33538, U.S. patent 

10 4,444,784 and Med . Chem . '29:849 (1986) . These are 

citations for synthesis of simvastatin from monacolin J. 
One would use the same method, but use the (2S) -2- 
methylbutyrate derivative to make lovastatin. 

In another embodiment of the present invention, one 

15 would transform the genes within the D4B segment, 
including an entire LovF/SCPKS gene, into the non- 
inhibitor-producing organism. By this method, one would 
produce lovastatin in a non- lovastat in-producing 
organism . 

20 In another embodiment of the present invention, one 

may transform a non- lovastat in-producing organism with 
the genes described above and obtain the production of an 
HMG CoA reductase inhibitor with a structure different 
from monacolin J, monacolin L or lovastatin, as described 

2 5 above. 

Modification of the LovB/NPKS gene would produce 
other inhibitors. For example, Fig. 3 diagrams the 
relationship between mevastatin, lovastatin, simvastatin 
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and pravastatin. In one example, the methyl transferase 

domain of the NPKS gene may be replaced with an inactive 

form to make pravastatin. The HMG-CoA reductase 

inhibitors within this invention include, but are not 

5 limited to, compactin (ML-236B) , lovastatin, simvastatin, 

pravastatin and mevastatin. 

Production of Intermediate Materials 

In another embodiment, the' present invention is a 
method of isolating intermediate materials in the 

10 production of lovastatin and analogs such as mevastatin 
and simvastatin. For example, the Examples below 
demonstrate the disruption of the lovastatin projection 
gene cluster with mutagenized LovC, LovD, LovF, LovA or 
LovB genes. Disruption of many of these genetic elements 

15 of the lovastatin production gene cluster will result in 
accumulation of intermediate materials. Therefore, to 
practice this embodiment of the present invention, one 
would transform a suitable lovastat in-producing host with 
a mutagenized gene within the D4B segment, as described 

2 0 below. 

Many other mutations would be suitable to destroy 
the function of LovC, LovD, LovF, LovA or LovB. All that 
is necessary is these genes be disrupted to the extent 
that they are non- functional . 



2 5 Production of Lovastatin Analogs 

In another embodiment, the present invention 
provides a method for engineering the production of 
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lovastacin analoqs in crm~h Avaanicm- - ^ 

-° xn sucn organisms as fungi or yeast 

using monacolin J as the starting point. 

Isolated DNA Segment- g 

In another embodiment, the present invention is a 
5 DNA segment capable of conferring lovastatin or monacolin 
J production or increase in lovastatin or monacolin J 
production in yeast or fungi. in a preferred example, 
this segment is the "D4B segment" that is deposited at 
ATCC 98876. The nucleotide sequence of this segment is 
found in residues 579 - 33,000 of SEQ ID NO: 18 and 
residues 1 - 5,349 of SEQ ID NO: 19. 

In another embodiment, the present invention is the 
entire A. Cerreus lovastatin gene cluster, as • exemplified 
by SEQ ID NOs:18 and 19 and ATCC deposits 98876 and 
15 98877. 

The present invention is also the individual genes 
that make up the A . terreus lovastatin gene cluster. 
Therefore, the present invention is a nucleic acid 
segment selected from the group of consisting of SEQ ID 
NOs:20 - 36. Preferably, the present invention is the 
coding region found within SEQ ID NOs:20 - 36 and 
described in Table 1. The present invention is also a 
mutagenized version of SEQ ID NOs:22, 24, 25 and 29, 
wherein the gene is mutagenized to be non- functional in 
terms of lovastatin or monacolin J production. 



20 



25 
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Organisms with Tnrr eased Lovastatin or Monacolin J 
Production 

In another embodiment ( the present invention are the 
organisms described above. These organisms include 
5 lovastatin-producing organisms, preferably yeast and 

fungi, that have been engineered to display at least a 2- 
fold increase in lovastatin or monacolin J production. 
The organisms also include non- lovastat in -producing 
organisms, preferably yeast or fungi, that have been 
10 engineered to produce monacolin J or lovastatin. 

Antifungal Compounds 

Applicants note that lovastatin, monocolin J , 
monocolin L and dihydromonocol in L all have varying 
degrees of antifungal activity. Applicants envision that 

15 the present invention is also useful for providing 

antifungal compounds and organisms engineered to express 
antifungal compounds. Preferably, one would measure the 
antifungal properties of a compound in the manner of N. 
Lomovskaya, et al . , Microbiology 143:875-883, 1997. 

20 Measurement of inhibition of yeast growth can be found in 
R. Ikeura, et al . , Antibiotics 41:1148, 1988. The 

same general methods could be used for all fungi. Both 
of these references are hereby incorporated by reference. 
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EXAMPLES 

1 . General Methods and Procedures 

Construction of an A. terreus ATCC20542 genomic library. 

A, terreus ATCC20542 genomic DNA was partially 
5 digested with Sau3AI so as to produce an average fragment 
size of 40 - 50 kb . The partially digested genomic DNA 
was then separated on a sucrose gradient and the 40 - 50 
kb fraction was collected. Cosmid AN26,. {Taylor and 
Borgmann, Fungal Genet . Newsletter 43, 1996) was prepared 

10 by digestion with Clal , dephosphorylated with CIP, then 
digested with BasnHI to create the two cosmid arms. 
Ligation reactions with genomic DNA fragments and cosmid 
arms were optimized and packaged using Gigapack III XL 
packaging extract (Stratagene) . The packaged cosmid 

15 library was infected into E . coli JM109 and plated out 
onto LB agar (Sambrook, et al . , Molecular Cloning. A 
Laboratory Manual . 2nd ed . Cold Spring Harbour 
Laboratory Press, 1989; other standard methods used can 
be found here also) with ampicillin (50 jig/ml) plates. 

20 After checking for the presence of insert DNA in a 

selection of clones, 5000 colonies were picked into LB 
plus 50 jzg/ml ampicillin filled microtitre plates and 
grown overnight at 37 C C. The colonies were replica 
plated onto nylon membranes (Amersham Hybond-N) . 

25 Glycerol was added at a final concentration of 15% (v/v) 
to the microtitre plates and these were stored at -70°C. 
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Isolation of genomic clones containing the lovastatin 
biosynthesis cluster . 

A 2.3 kb EcoRI fragment from pTPKSlOO containing 

.part of the NPKS gene (Vinci, et al., U.S. Patent No. 

5 5,744,350) was gel - i solated and labelled with digoxigenin 

using the Genius Kit II (Boehringer Mannheim) . This 

labelled fragment was hybridized (65°C, 5x SSC) with the 

nylon membranes containing the A. terreus genomic 

library, then washed (65°C, 0 . lx SSC). Two positive 

10 clones were identified, pWHM1263 (cD4B) and pWHM1264 
(cJ3A) . Two of these clones, pWHM1263 (cD4B) and 
pWHM1265 (cB2A4), have been deposited in the ATCC 
(American Type Culture Collection, 10801 University 
Boulevard, Menassas , VA 20110) at accession number ATCC 

15 98876 and 98877, respectively, under the terms and 

conditions of the Budapest Treaty. The presence of the 
NPKS gene was confirmed initially by restriction 
digestion and later by DNA sequencing. 

Overlapping clones were found by repeating the 

20 hybridization process using labelled fragments from both 
ends of the insert in pWHM1263 . This resulted in the 
isolation of pWHM1265- 1270 (cB2A4, cL3E2 , cJ3B5, C02B5, 
cR3B2 , CW3B1) from downstream of the NPKS gene and 
pWHM1271 (cQIFl) from upstream of NPKS. All these clones 

25 were transformed into E. coli strain STBL2 (Stratagene) 
to help prevent rearrangements . 

Fig. 4 is a diagram of the cB2A4 /pWHM12 65 clone. 
This clone contains an insert of approximately 43 kb in 
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AN26 and includes the nucleotide sequence from at least 
nucleotides 4988 of SEQ ID NO: 19 to nucleotide 31,328 of 
SEQ ID NO: 19 and 10 - 14 kb of uncharacteri zed DNA . Fig. 
6 is a schematic diagram of cD4B/pWHM1263 . This clone 
5 contains a 37,770 bp insert in AN26 and contains 

nucleotides 579 - 33,000 of SEQ ID NO: 18 and nucleotides 
1 - 5,349 of SEQ ID NO:19. 
Sequencing strategy and analysis. 

A series of overlapping subclones (pWHM1272- 

10 pWHM1415) were constructed in pSPORTl (Gibco-BRL) and 
pGEM3 (Promega) . Plasmid DNAs for sequencing were 
prepared using the QiaPrep spin miniprep kit (Qiagen) . 
Cycle sequencing was carried out using the AmpliTaq FS or 
BigDye reagents (ABI) and were analyzed using a ABI model 

15 373 or 377 DNA Sequencer. Primer walking was performed 

by synthesis of 18-22 bp oligonucleotide primers based on 
the sequenced DNA strand, with the help of the Oligo 4.05 
program (National Biosciences, Inc.). Every region of 
DNA was sequenced at least once on both strands. Direct 

20 sequencing of cosmids and PCR products was used to 

confirm adjoining regions where no overlapping clones 
existed. DNA sequence analysis and manipulations were 
performed using SeqMan ( DNAS TAR) and SeqEd (ABI) 
software. Assignments of putative ORFS , including 

25 putative introns, were performed with the aid of BLAST 

2.0 searches (Atschul, et al . , Nucl . Acids Res . 25:3389- 
3402, 1997), and the Genetics Computer Group (GCG) 
programs (Program Manual for the Wisconsin Package, 
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Version 8, September 1994, Genetics Computer Group, 

•Madison, WI ) , version 8.1. 

Isolation and characterization of lovF (ScPKS, ORF11) , 
lovD (EST1, ORF7), lovC (DH, ORF6) , and lovA (P450I, 
5 ORF3) mutants. 

lovF 

To disrupt the polyketide synthase gene, lovF, a 1.7 
kb EcoRI fragment internal to the lovF gene was subcloned 
from pWHM1265 into pSPORTl to give pWHM1291 . The ScPKS 

10 fragment was then subcloned from this vector, as an 

Acc65I - Hindi I I fragment, into pPLOA (Vinci, et al . , 
U.S. Patent No. 5,744,350) to give pWHM1416. This vector 
contains the phleomycin (Zeocin, obtained from 
InVitrogen) resistance gene for selection in A* terreus . 

15 A . terreus ATCC20542 was then transformed to Zeocin 
resistance with this plasmid as described below. 
Transf ormants were screened for lovastatin production as 
described below (Method A) . In one of the transf ormants , 
WMH173 1, lovastatin production was abolished and a new 

2 0 compound accumulated. This new compound comigrated with 
monacolin J on TLC and HPLC according to the methods 
described below. Semi -preparative HPLC was used to 
partially purify the major product which was then 
analyzed by HPLC - MS. The same mass and fragmentation 

25 pattern as authentic monacolin J was observed. To 

confirm the disruption of the lovF gene, total genomic 
DNA was prepared from wild- type A. terreus ATCC20542 and 
the WMH1731 mutant strain. The genomic DNA was digested 
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with BasriRI and Hindlll, elect rophoresed on an agarose gel 

and capillary blotted onto a nylon membrane. The 
membrane was hybridized with the 1.7 kb EcoRI fragment 
from pWHM1416 labelled using the Genius II kit 
5 (Boehringer Mannheim) using the conditions described 

previously. The wild-type strain had hybridizing bands 
at 4.2 kb for BamHI and 11.5 kb for Hindlll. As 
predicted, the WMH1731 mutant strain had hybridizing 
bands at 6.5 kb and 2.2 kb for BawHI and 11 kb and 7.8 kb 
10 for Hindi I I confirming the homologous integration of a 
single copy of pWHM1416 at the lovF locus. 
lovD 

To disrupt the putative esterase/carboxypept idase - 
like gene, lovD, a 4.8 kb NotI - EcoRI fragment from 

15 pWHM1263 was subcloned into pSPORTl to give pWHM1274 . 

This plasmid was digested with Hindi I I and SsiWI and a 
1.8 kb fragment was isolated. The plasmid was also 
digested with Hindi I I and SamHI and the 6.6 kb fragment 
was isolated. pPLOA was digested with BajnHI and Acc65I 

20 and the 2.1 kb fragment containing the phleomycin 

resistance marker was purified. These three fragments 
were ligated together and used to transform competent E . 
coli cells. The expected plasmid, pWHM1417, containing 
the phleomycin resistance gene flanked by the beginning 

25 and the end of the lovD gene was isolated. This plasmid 
was linearized by digestion with Xbal or RsrII before 
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being used to transform A. terreus ATCC20542 to Zeocin 

resistance. Transf ormant s were screened for lovastatin 

production as described below (Method A) . In one of the 

transf ormant s , WMH1732, lovastatin production was 

5 abolished and a new compound accumulated. This new 

compound comigrated with monacolin J on TLC and HPLC 

according to the methods described below. Semi- 

preparative HPLC was used to partially purify the major 

product which was then analyzed by HPLC - MS. The same 

10 mass and fragmentation pattern as authentic monacolin J 
was observed. To confirm the disruption of the lovD 
gene, total genomic DNfA was prepared from wild type A. 
terreus ATCC20542 and the WMH1732 mutant strain. The 
genomic DNA was digested with Apal , run out on an agarose 

15 gel and capillary blotted onto a nylon membrane. The 
membrane was hybridized with the 4.8 kb Wot I - EcoRI 
fragment from pWHM1274 labelled using the Genius II kit 
using the conditions described previously. The wild- type 
strain had hybridizing bands at 9 kb, 8.4 kb and 1.5 kb . 

20 As predicted the mutant strain had hybridizing bands at 9 
kb, 8 kb, 3 kb and 1.5 kb confirming the homologous 
integration of a single copy of pWHM1417 at the lovD 
locus . 
lovA 

25 To disrupt the cytochrome P450 I gene, 2ovA, an 11 

kb Acc65I - EcoRI fragment from pWHM1263 was subcloned 
into pGEM3 to give pWHM12 72 . From this plasmid a 2.1 kb 
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Apal - SnaBI fragment was purified and ligated to Apal - 

EcoRV digested pPLOA to give p450Phleo (pWHM1418) . From 

this plasmid a 4.2 kb Apal - No tl fragment was purified 

and ligated with a 1.8 kb EagI - Kpnl fragment from 

5 pWHM1272 and Apal - Kpnl digested pGEM7 to. give 

p450Dphleo (pWHM1419) which contains the lovA gene 

disrupted by the phleomycin resistance gene . This 

plasmid was then digested with Kpnl and Apal and the 

resulting fragment was used to transform A. terreus 

10 ATCC20542 to Zeocin resistance. Transf ormant s were 

screened for lovastatin production as described below 
(Method A). In one of the transf ormants , WMH1733, 
lovastatin production was abolished and two new compounds 
accumulated. Genomic DNA was prepared from this strain 

15 and from A. terreus ATCC20542, digested with EagI, run 

out on an agarose gel, and ' capi 1 lary blotted onto a nylon 
membrane. The membrane was hybridized with the 6 kb Apal 
- Kpnl fragment from pWHM14 19 labelled using the Genius 
II kit using the conditions described previously. The 

20 wild-type strain had hybridizing bands at 2.0 kb, 1.9 kb 
and 1.1 kb. Mutant strain WMH1733 had hybridizing bands 
at 2.5 kb, 2.0 kb, 1.1 kb and 0.7 kb confirming the 
homologous integration of a single copy of the fragment 
from pWHM14 19 at the lovA locus. 
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lovC 

To disrupt the dehydrogenase - 1 ike gene, lovC, a 2 kb 
EcoRI - Bglll fragment from pTPKSlOO was ligated with a 
1 . 7 kb EcoRI - Sad fragment from pWHM1274 and Bglll - 
SacI digested litmus 28 (New England Biolabs) to produce 
pDHl (pWHM1420) . Another plasmid pDH2 (pWHM1421) was 
constructed from a 2.2 kb AccGSI - SacI fragment from 
pWHM1274, a 2 . 1 kb Hindi I I - SacI fragment from pPLOA 
containing the phleomycin resistance gene and Hindi I I - 
Acc65I digested litmus 28. The disruption vector pDH-dis 
(pWHM1422) was constructed by ligating together a 2.5 kb 
Bglll - Hpal fragment from pWHM1420, a 4 . 3 kb EcoRV - 
Kpnl fragment from pWHM1421 and Bglll - Kpnl digested 
litmus 28. This plasmid was digested with Bglll and Kpnl 
and the resulting 6.8 kb fragment was used to transform 
A. terreus ATCC20542 to Zeocin resistance. Transf ormants 
were screened for lovastatin production as described 
below (Method A) . In two of the transf ormants , WMH1734 
and WMH1735, lovastatin production was abolished. 
Genomic DNA was prepared from these strains and from A. 
terreus ATCC20542, digested with EagI , run out on an 
agarose gel, and capillary blotted onto a nylon membrane. 
The membrane was hybridized with the 6.8 kb Bgrl II- Kpnl 
fragment from pWHM1422 labelled using the Genius II kit 
using the conditions described previously. The wild type 
strain had hybridizing bands at 5 kb, 1.5 kb and 1.3 kb . 
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Mutant strain WiMH1734 had hybridizing bands at 4.9 kb, 

1.3 kb, 1.0 kb and 0.7 kb confirming the homologous 
integration of a single copy of the fragment from 
pWHM1422 at the lovC locus. The other mutant strain, 
5 WMH173 5, had a similar banding pattern but with 

additional hybridizing bands indicating that multiple 
integration events had occurred, one of which was at the 
lovC locus. 

Construction and characterization of the A . terreus 
10 strain with extra copies of lovE . 

A 10.4 kb No tl- EcoRI fragment containing the 

putative regulatory gene, lovE was subcloned from 

pWHM1263 to pSPORTl to give pWHM12 76 . From this plasmid 

a 3 . 9 kb Hindlll - BamHI fragment was subcloned into 

15 pGEM7 to give pWHM14 23 . The regulatory gene was 

subcloned from this vector into pPLOA as an SstI - XbaRI 
fragment to give pWHM1424 (Fig. 5). pWHM1424 contains 
nucleotides 30,055 - 33,000 from SEQ ID NO: 18 and 
nucleotides 1 - 1,026 from SEQ ID NO: 19. 

20 Extra copies of the regulatory gene were introduced 

into A . terreus ATCC20542 by transformation to Zeocin 
resistance with pWHM1424 . Transf ormants were fermented 
(method A) and screened for lovastatin production 
initially by TLC analysis. Most of the transf ormants 

25 appeared to be producing significantly more lovastatin 

than the wild- type strain. The yields of lovastatin from 
the two transf ormant strains, WMH1736 and WMH1737, which 
had the most elevated levels compared to the wild-type 
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was quantified by HPLC as described below. These were 

found to produce 7 -fold and 5- fold more lovastatin chan 
the A . cerreus ATCC20542 strain. 

Because of the way that the DNA integrates 
5 (ectopically) , each transformant is or can be unique, 

genotypically and phenotypically . However, some will be 
overproducers ; others may exhibit no difference, for 
unknown reasons . 

Heterologous expression of the lovastatin biosynthesis 
10 genes . 

To place the NPKS gene (lovB) under the control of 
the inducible alcA promoter, the 11.5 kb Kpnl - Avrll 
fragment from pTPKSlOO containing the NPKS open reading 
frame was ligated into pAL3 (Waring, et al . , Gene 79:119, 

15 1989) previously digested with Kpnl and Xjbal . The 

resulting plasmid was designated pAL3TPKS (WHM1425) . The 
polymerase chain reaction was used to amplify the NPKS 
gene sequence between the NPKS promoter region just 
upstream of the t ranslat ional start codon and a Agel site 

20 internal to NPKS. The design of the forward primer 

introduced a Kpnl site 31 bases from the t ranslat ional 
start codon allowing the NPKS to be placed against the 
alcA promoter but also incorporating upstream elements 
from the A. terreus system. Amplification was performed 

25 using Vent DNA polymerase with pTPKSlOO as template and 1 
/imol of each primer in a final volume of 100 /zl using the 
manufacturer's buffer recommendations. After an initial 
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dena curat ion cycle of 10 minutes at 9 5°C amplification 

was achieved with 3 0 cycles of 9 5°C for 1 minute; 5 5 C C for 

1 minute and 7 2 °C for 1-5 minutes . The final cycle was 

followed by 10 minutes at 72 °C to ensure complete 

5 polymerization. The amplified product (1.7 kb) was 

digested with Kpnl and Agel and ligated into pWHM1425 

that had been digested with the same enzymes and gel 

isolated . The resul t ing plasmid was designated 

pAL3TPKSNT (pWHM1426) . The region introduced by PCR was 

1 0 sequenced on a ABI automated DNA sequencer to ensure 
sequence fidelity. This plasmid was then used to 
transform A. nidulans strain A722 (Fungal Genetics Stock 
Centre) to uridine prototrophy . 

Transf ormant s were grown by inoculating 0.5 ml of 

15 spore suspension (10 s c.f.u./ml) in 50 ml YEPD in a 250 ml 
unbaffled flask. This was then grown for 20 hours at 250 
rpm and 3 7 °C (New Brunswick Scienti f ic Series 2 5 
Incubator Shaker) . The mycelia were then harvested by 
f il trat ion through Miracloth (Calbiochem) , rinsed with 

20 sterile, distilled water, and inoculated into fresh 2 50 
ml unbaffled flasks containing 50 ml AMM + lactose + 10 
mM cyclopentanone and grown for a further 2 0 hours under 
the same conditions . The mycel ia were harvested by 
filtration using Miracloth (Calbiochem) , squeezed as dry 

25 as possible and frozen in liquid nitrogen. Protein 

extracts for SDS-PAGE and western analysis were prepared 
as described in Kennedy and Turner, Molec . Gen . Genet . 
(1996), 253:189-197, 1996. 
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One transformant, WMH173 8, was shown to have a large 

protein (>2.00 kDa) visible on a SDS-PAGE gel that cross 

reacted with the affinity purified NPKS antibodies 

(Panlabs) . This strain WMH1738 was transformed to 

5 hygromycin B resistance with pWHM1263 . Transformant 

colonies were screened for lovastatin resistance and for 

the production of new metabolites as described below and 

two strains WMH1739 and WMH1740 were chosen for further 

analysis. Both of these strains were found to be 

10 significantly resistant (up to 100 jig/ml on solid media) 
to lovastatin compared with the host strain. This was 
analyzed by streaking 10 fil of a spore suspension on 
solid AMM plates containing lovastatin at 0, 0.1, 0.5, 1, 
5, 10, 50 and 100 /ig/ml and incubating at 37°C. Strains 

15 WMH1739 and WMH1740 were compared to strains WMH1741 and 
WMH1742 which were derivatives of WMH1738 transformed to 
hygromycin resistance with AN26. Strains WMH1739 and - 
1740 exhibited no inhibition of growth at any of these 
lovastatin concentrations whereas strains WMH1741 and - 

20 1742 showed slight inhibition of grown at 5 /ig/ml and 

almost complete growth inhibition at 50 ^g/ml . The two 
lovastatin resistant strains were fermented in 
lovastat in-producing conditions using fermentation method 
B and extracts were analyzed for lovastatin related 

25 metabolites as described below. Both strains were found 
to produce new metabolites. One compound that was common 
to both comigrated with monacolin J on TLC and HPLC 
analysis by the methods described below. Semi- 
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preparative KPLC was used to partially purify some of 

this compound, which was then analyzed by HPLC -MS. It 
had the same mass and fragmentation pattern as authentic 
monacolin J. The other compound, found in only one of 
5 the strains, comigrated with monacolin L on TLC and HPLC. 
METHODS 

Solid medium for growth of A. terreus 

For the generation of spore suspensions A. terreus 

strains were grown on CM agar at 30°C for 4 to- 5 days. 

10 CM Agar (for CM liquid medium the agar was omitted) : 
50 ml Clutterbuck's salts (Vinci, et al . , U.S. 

Patent No. 5,744,350) 

2 ml Vogel's trace elements (Vinci, e_t aJ., U.S. 
15 Patent No. 5,744,350) 

0.5% Difco Bacto tryptone 

0.5% Difco Bacto yeast extract 

1% glucose 

2% Difco Bacto agar 
20 in 1 liter of distilled water 

Clutterbuck' s salts : 

12% NaN0 3 
1.02% KC1 
1.04% MgSO, .7H 2 0 
25 3.04% KH 2 PO< 

Vogel's trace elements: 

0.004% ZnCl 2 
0.02% FeCl 3 
0.001% CuCl 2 
30 0.001% MnCl 2 *4H r O 

0.001% Na 2 B«0 7 - 10H 2 O 

0 . 001% (NH. ) e Mo-,0 2 . ■ 7H 2 0 

For long term storage A. terreus spores were 
suspended in SSS ( 10% glycerol , 5% lactose) and stored at 
35 -70°C. 
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For the generation of spore stocks A. nidulans was 

grown on the following solid growth medium (ACM) for 3 to 

4 days at 37°C. 

ACM : 

5 2% Difco Bacto malt extract 

0.1% Difco Bacto peptone 
2% glucose 

2% agar (Difco, Detroit, MI) 

For strains which required para-aminobenzoic acid 
10 (PABA) for growth, PABA was added to a final 

concentration of 1 /ig/ml . For strains which required 
uracil and uridine these were added at 20 mM and 10 mM, 
respectively. Spores were suspended in Tween 80 - saline 
solution (0.025% Tween 80, 0.8% NaCl) and stored at 4"C. 
15 AMM : 

0.6% (w/v) NaNOj 
0.052% (w/v) KC1 
0.152% (w/v) KH r P0 4 
0.052% (w/v) MgSO, - 7H.O 
20 1% (w/v) glucose 

0.1% (v/v) AMM trace elements solution 

pH to 6.5 and make up to 1 liter with distilled 

water . 

For preparation of plates 2% (w/v) Difco Bacto agar 
25 was added. If required the glucose can be omitted and an 
alternative carbon source (e.g., lactose added at the 
same concentration) . For the preparation of 
transformation plates KC1 was added at 4.47% (w/v) (0.6 
M) . 

30 AMM trace elements solution: 

0.1% (w/v) FeS0 4 *7H 2 0 
0.88% (w/v) ZnS0 4 -7H 2 0 
0.04% (w/v) CuS0 4 >5H 2 0 
0.015% (w/v) MnS0 4 -4H 2 0 
35 0.01% (w/v) Na 2 B,0 7 • 10H 2 O 
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0.005% (NK. ) s Mo,0 2 . * 7H 2 0 
disiillea wacer to 1 liter 

Large scale genomic DNA preparation from A . terreus for 
genomic library construction. 

5 A 2,5 ml aliquot of spore suspension (10 s c . f . u . /ml ) 

was used to inoculate 500 ml of liquid CM medium and 

grown for 20 hours at 30°C and 200 rpm. The mycelium was 

harvested by filtration through Miracloth (Calbiochem) 

and rinsed extensively with water then TSE [150 mM NaCl , 

10 100 mM Na ? EDTA, 50 mM Tris-HCl pH 8.0] . The mycelium was 
squeezed dry, broken into small pellets and frozen in 
liquid nitrogen then ground to a fine powder in a pre- 
chilled pestle and mortar followed by transferral to a 
500 ml flask. Fifty ml of extraction buffer [150 mM 

15 NaCl, 100 mM Na 2 EDTA, 50 mM Tris-HCl pH 8.0, 2% (w/v) SDS] 
and 10 ml of toluene was added to the flask which was 
shaken at 60 rpm for 72 hours. This mixture was 
centrifuged at 1000 x g for 15 minutes and the 
supernatant was removed and extracted with an equal 

20 volume of chloroform : isoamyl alcohol (24:1 vol/vol). 

This mixture was centrifuged at 10,000 x g for 30 minutes 
at 15°C. The aqueous layer was carefully removed and 1.1 
volumes of ethanol was layered on top. The DNA was 
spooled out from the resulting suspension and resuspended 

25 in 5 ml TE [10 mM Tris-HCl pH 8.0, 1 mM EDTA] + 50 /xg/ml 
RNase and 100 ^g/ml proteinase K then incubated at 37°C 
for 2 hours. The mixture was extracted again with 
chloroform : isoamyl alcohol (24:1) and the DNA was spooled 
out as before. Following resuspension in 1 ml of TE the 
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DNA was extracted once with phenol : chloroform : isoamyl 

alcohol (25:24:1, vol/vol)/ once with chloroform: isoamyl 

alcohol (24:1) and precipitated with 0.6 volumes 

isopropanol. The DNA clot was removed, dried briefly and 

5 resuspended in 0.5 ml TE . 

Small scale genomic DNA preparation from A. terreus for 
Southern blot. 

A 0.5 ml aliquot of spore suspension (10 s c . f . u . / ml ) 

was used to inoculate 100 ml of liquid CM and grown for 

10 20 hours at 30°C and 200 rpm. The mycelium was harvested 
by filtration through Miracloth (Calbiochem) and rinsed 
extensively with water then TSE [150 mM NaCl , 100 mM 
Na.EDTA, 50 mM Tris-HCl pH 8.0]. The mycelium was 
squeezed dry, broken into small pellets and frozen in 

15 liquid nitrogen. The mycelium was ground to a fine 

powder in a pre-chilled pestle and mortar and transferred 
to a mortar pre-heated to 65°C. Three ml of lysis buffer 
[0.5 M NaCl, 10 mM Tris-HCl pH 7.5, 10 mM EDTA, 1% (w/v) 
SDS] at 65°C was added and 0.3 ml of 10% (w/v) 

20 cetyl t rime thyl ammonium bromide in 0.7 M NaCl. After 
thorough mixing to form a slurry, 3 ml of 
phenol : chloroform : isoamyl alcohol (25:24:1) was added. 
This mixture was transferred to a Corex tube and 
incubated at 65°C for 15 minutes. Following 

25 centrif ugation at 12,000 x g for 15 minutes at 4°C the 

aqueous phase was carefully removed and re-extracted once 
with phenol, once with phenol : chloroform : isoamyl alcohol 
(25:24:1) and once with chloroform : isoamyl alcohol 
(24:1) . The DNA was precipitated from the extract by 
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addition of 0.1 volume of 3 M sodium acetate pH 5 and 0.6 

volumes isopropanol then collected by cent ri ruga t ion 
(10,000 x g, 10 minutes, 4°C) . After washing with 70% 
ethanol the pellet was briefly dried and resuspended in 
5 TE + RNase (50 pig/ml) . 

Transformation of A. terreus . 

A 0.5 ml aliquot of spore suspension (10 s c.f.u./ml) 
was used to inoculate 100 ml of liquid CM and grown for 
20 hours at 30°C and 200 rpm. The mycelium was harvested 

10 by centrif ugat ion at 2000 x g for 15 minutes at 4 3 C and 
washed twice with an aqueous solution containing 0.27 M 
CaCl 2 and 0.6 M NaCl. To produce protoplasts the washed 
mycelia was resuspended in 20 ml of the same solution 
containing 5 mg/ml Novozym 234 (NovoNordisk) and 

15 incubated at 30°C for 1-3 hours with gentle agitation. 
Protoplasts were separated from undigested mycelia by 
filtration through Miracloth (Calbiochem) . The 
protoplast suspension was diluted with an equal volume of 
STC1700 [1.2 M sorbitol, 10 mM Tris-HCl pH 7.5, 35 mM 

20 NaCl] and incubated on ice for 10 minutes. The 

protoplasts were collected by centrif ugation (2000 x g, 
10 minutes, 4°C), washed with STC1700 and resuspended in 
1 ml STC1700. Plasmid DNA, purified using Qiagen 
columns, (2-5 /zg in 10 ^1) was added to 150 /il of 

25 protoplast suspension and incubated at room temperature 
for 25 minutes. PEG solution [60% (w/v) polyethylene 
glycol 4000, 50 mM CaCl 2 , 10 mM Tris-HCl pH 7.5] was added 
to the DNA/protoplasts mixture in three steps: 250 /zl, 
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250 /ii, and 850 pi with mixing after each addition. The 
suspension was incubated at room temperature for 25 
minutes then diluted to 10 ml with STC1700. Protoplasts 
were collected by cent ri fugat ion as above and diluted 
5 with 500 fil STC1700. 100 /il aliquots of this mixture 

were plated onto osmotically stabilized plates [CM medium 
containing 3% (w/v) Difco Bacto agar and 23.4% (w/v) 
mannitol, 15 ml of agar per plate] . After 4 hours growth 
at 30°C / 25 ml of OL agar [1% (w/v) Difco Bacto peptone, 

10 1% (w/v) Difco Bacto agar, 200 pg/ml Zeocin] was 

overlayered onto each dish. The plates were incubated 
for 3-4 days at 30°C before transformant colonies were 
picked. These were streaked to single colonies twice on 
selective media (CM + 100 /ig/ml Zeocin) before spore 

15 suspensions were prepared. 

Transformation of A. nldulans . 

A 0.5 ml aliquot of spore suspension (10 8 c.f.u./ml) 
was used to inoculate 100 ml of YEPD [2% (w/v) Difco 
Bacto yeast extract, 2% (w/v) glucose, 0.1% Difco Bacto 

20 peptone] liquid medium including necessary supplements 

and grown for 20 hours at 37°C and 200 rpm. The mycelia 
was harvested by cent ri fugat ion (2000 x g, 10 minutes, 
4°C) and washed twice with 0.6 M KC1 . To generate 
protoplasts the mycelia was resuspended in 20 ml of 0.6 M 

25 KC1 containing 5 mg/ml Novozym 234 and incubated at 30°C 
for 1-2 hours with gentle shaking. Protoplasts were 
separated from undigested mycelia by filtration through 
Miracloth (Calbiochem) . The protoplasts were harvested 
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by centrif Ligation as described above and washed twice 

with 0.6 M KC1 , then resuspended in 10 ml 0.6 M KC1 + 50 

mM CaCl 2 . After counting in a haemocytomet er the 

protoplasts were harvested by cent ri fugat ion as before 

5 and resuspended to a final concentration of 5 x 10 s 

protoplasts/ml . To 50 /il of protoplast suspension, 5 fil 

of DNA (2-5 jig, purified using Qiagen columns) was 

added, then 12.5 /il of PEG solution [25% (w/v) PEG 6000, 

50 mM CaCl 2 , 10 mM Tris - HC1 pH 7.5] and the mixture was 

10 incubated on ice for 20 minutes. A further 0.5 ml of PEG 
solution was added and the mixture was incubated on ice 
for a further 5 minutes. A 1 ml aliquot of 0.6 M KC1 +■ 
50 mM CaCl 2 was added and the protoplasts were plated out 
in 50 /il, 200 /il, and 400 /il aliquots. For 

15 transformation to uridine prototrophy, protoplasts were 
plated out onto AMM + 0.6 M KCl plates without adding 
uridine or uracil supplements.- Plates were incubated at 
37°C for 3-4 days when transf ormant s were picked. For 
transformation to hygromycin B resistance protoplasts 

20 were plated out onto AMM + 0.6 M KCl plates (15 ml) and 
incubated for 4 hours at 30°C. 30 ml of 1% peptone, 1% 
agar, 1 mg/ml hygromycin B was then used to overlay the 
plates, which were incubated for 3-4 days when 
transf ormant s were picked. Transf ormant s from both 

25 methods were streaked out to single colonies on selective 
media (i.e., lacking uridine/uracil supplements or 
containing 1 /ig/ml hygromycin B) twice before spore 
suspensions were made. 
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Analysis of strains for lovastatin production. 



Two fermentation methods were used for the analysis 



of lovastatin production. In Method A, 0 . 5 ml of spore 
suspension (10 8 c . f . u . /ml ) was inoculated into 25 ml of 
5 SEED medium in 250 ml unbaffled flasks and grown for 18 

hours at 250 rpm and 30°C (New Brunswick Scientific Model 
25 incubator/shaker) . A 1 ml portion of the resulting 
seed culture was used to inoculate 25 ml of FM in a 250 
ml unbaffled flask and grown for 6 days in the conditions 

10 described above. Fermentation Method B involved 

inoculating 50 ml of RPM in a 250 ml unbaffled flask with 
0.5 ml of spore suspension (10 8 c.f.u./ml) and growing at 
30°C and 250 rpm for 7 days in a New Brunswick Scientific 
Series 25 Incubator Shaker. 

15 SEED medium: 



20 



0.5% (w/v) Sigma corn steep liquor 
4% (w/v) tomato paste 
1% (w/v) oat flour 
1% (w/v) glucose 

1% (v/v) Vogel ' s trace elements 
distilled water to 1 1 



FM: 



25 



4.5% (w/v) glucose 

2.4% (w/v) Sigma peptonized milk 
0.25% (w/v) Difco Bacto yeast extract 
0.25% (w/v) polyethylene glycol 2000 
distilled water up to 1 1 



RPM: 



30 



35 



4% (w/v) lactose 

0.3% (w/v) rapeseed meal 

0.2% (w/v) KN0 3 

0.3% (w/v) KH 2 P0 4 

0.05% (w/v) MgSCV7H 2 0 

0.05% (w/v) NaCl 

0.05% (v/v) Sigma antifoam B 

0.05% (v/v) trace elements solution 

pH to 6.5 and made up to 1 1 with distilled water. 
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Trace elements solution is: 

0.16% (w/v) MnSO« 
0.34% (w/v) ZnSCL - 7H 2 0 
0.2% (w/v) CoCl 2 -6H 2 0 
5 0.5% (w/v) FeS0 4 -7H 2 0 

made up to 1 liter with distilled water. 

The cultures were extracted by adjusting the pH of 

the media to 3 with HCl , adding an equal volume of ethyl 

10 acetate, and shaking the mixture on a New Brunswick 

Scientific Series 25 incubator/shaker at 250 rpm for 2 
hours. For analysis, 1 ml of the ethyl acetate layer was 
dried under a nitrogen stream and resuspended in 0.1 ml 
of methanol. For TLC analysis 10 /il of this extract was 

15 run on C-18 reverse phase TLC plates (RP-18 F 254 - Merck) 
in a solvent system of methanol : 0 . 1% phosphor-ic acid 
(9:1) . TLC plates were developed by spraying with 10% 
phosphomolybdic acid in methanol and heating with a heat 
gun. Extracts were compared with authentic lovastatin, 

20 monacolin J, monacolin L, and dihydromonacol in L (acid 

and lactone forms) . For HPLC analysis a Waters Nova-Pak 
C 18 (3.9 x 150 mm) column was used with a solvent system 
of acetonitrile (B) and 0.1% phosphoric acid (A). The 
column was eluted with a preprogrammed gradient of 0 to 

25 100% B into A over 25 minutes using gradient 7 (Waters 
Millenium Software) with a flow rate of 1.5 ml/min and 
metabolites were detected with a Waters 996 Photodiode 
Array Detector; lovastatin was detected at 238 nm. For 
purification of metabolites a Waters Prep Nova-Pak HR C 18 

30 (7.8 x 300 mm) column was used. The same solvent system 
as above was used with gradient of 0 to 100% B in A over 
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75 minutes at a flow rate of 4.5 ml/min. Fractions were 

collected manually, back extracted with ethyl acetate and 

dried. For HPLC-MS an Aquapore OD-300 7 micron (1.0 x 

100 mm) column was used with a gradient of 0 to 100% 

5 acetonitrile into A (0.05% TFA) over 30 minutes at a flow 

rate of 0.02 ml/min. 



-41- 



BNSDOCID: <WO 0037629 A2 I > 



WO 00/37629 

We claim: 



CLAIMS 



PCT/US99/29583 



1. A method of increasing the production of 
lovastatin in a lovas tat in-producing organism, comprising 
the steps of transforming the organism with the D4B 
segment, wherein the segment is transcribed and 

5 translated, and wherein an increase in lovastatin 
production occurs. 

2 . The method of claim 1 wherein the D4B segment 
is the A . terreus D4B segment. 



3. The method of claim 1, wherein the D4B segment 
is identical to nucleotides 579 - 33,000 of SEQ ID NO: 18 
and 1 - 5,349 of SEQ ID NO : 1 9 . 

4. The method of claim 1, wherein the lovastatin- 
producing organism is selected from the group consisting 
of A. cerreus ATCC 20542 and ATCC 20541. 

5. The method of claim 1, wherein the organism is 
selected from the group consisting of fungi and yeast. 

6. The method of claim 1 wherein the increase is 
at least 2-fold. 
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7. The method of claim 1 wherein the nucleic acid 

sequence is identical to a sequence isolated from ATCC 
98876 . 

8. The method of claim 1 additionally comprising 
transforming the organism with the entire A. terreus 
lovastatin gene cluster. 

9. The method of claim 8 wherein the gene cluster 
comprises SEQ ID NOs:18 and 19. 

10. The method of claim 8 wherein the nucleic acid 
sequence of the gene cluster is identical to sequences 
isolated from ATCC 98876 and 98877. 

11. A method of increasing the production of 
monacolin J in a lovastatin- producing organism, 
comprising the steps of transforming the organism with 
the D4B segment, wherein the segment is translated, and 

5 wherein an increase monacolin J production occurs. 

12 . A method of increasing the production of 
lovastatin in a lovastat in-producing organism, comprising 
the step of transforming the organism with the LovE gene, 
wherein the nucleic acid sequence is translated, and 

5 wherein an increase in lovastatin production occurs. 
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13 . The method of claim 12 wherein the increase is 

at least 2.0-fold. 



14 . The method of claim 13 wherein the increase is 
at least 5-fold. 



15. The method of claim 12 wherein the nucleotide 
sequence of the LovE gene comprises SEQ ID NO: 27. 



16. A method of increasing the production of 
lovastatin in a lovastat in-producing organism comprising 
the steps of transforming the organism with a nucleic 
acid sequence comprising a truncated version of the A. 

5 terreus D4B segment, wherein the nucleic acid sequence is 
transcribed and translated and wherein an increase in 
lovastatin production occurs. 

17. A method of increasing the production of 
lovastatin in a lovastat in-producing organism comprising 
the steps of transforming the organism with a nucleic 
acid sequence comprising a truncated version of the A . 

5 terreus lovastat in-producing gene cluster, wherein the 
nucleic acid sequence is transcribed and translated and 
wherein an increase in lovastatin production occurs. 
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18. A method of increasing or conferring the 
production of monacolin J in a non- lovastat in-producing 
organism comprising the steps of transforming the 
organism with a nucleic acid sequence comprising the D4B 

5 segment, wherein the nucleic acid sequence is transcribed 
and translated and wherein an increase in monacolin J 
production occurs . 

19. The method of claim 18 wherein the D4B segment 
is the A. tejrjreus D4B segment. 

20. The method of claim 18 wherein the D4B segment- 
comprises nucleotides 579 - 33,000 of SEQ ID NO: 18 and 1- 
5,349 of SEQ ID NO : 1 9 . 

21. The method of claim 18 additionally comprising 
the step of converting the monacolin J into lovastatin. 

22 . The method of claim 18 additionally comprising 
the step of transforming the organism with a nucleic acid 
sequence comprising the LovF gene, wherein the nucleic 
acid sequence is transcribed and translated and wherein 

5 an increase in lovastatin production occurs. 

23 . An isolated nucleic acid sequence selected from 
the group consisting of SEQ ID NOs:20 - 36. 
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24. A lovastat in-producing organism, wherein the 

organism has been genecically modified to have increased 
lovastatin production, wherein the increase is at least 
2-fold. 



25. The organism of claim 24, wherein the organism 
is a yeast or a fungi. 

26. A non- lovastat in producing organism, wherein 
the organism has been genetically modified to produce 
monacolin J. 



27. The organism of claim 26, wherein the organism 
is a yeast or a fungi . 



28. A non- lovastat in producing organism, wherein 
the organism has been genetically modified to produce 
lovastatin . 

29. The organism of claim 28 wherein the organism 
is a yeast or a fungi. 
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<1 40> 
<141> 

<160> 36 

<I70> Patentln Ver. 2.0 

<210> 1 
<2il> 1529 
<212> PRT 

<213> Aspergillus terreus 
<4GC> 1 

Met Ala Ser Leu Leu Phe Phe Thr Val Phe Asn Leu Thr Leu Ala Leu 
1 5 10 1 5 

Leu Ser Ser Thr Ala Thr Gly Ala Ala Val Pro Val Ser Arg Pre Thr 
20 25 30 

Asp Asp Ser Arg Tyr lie Asp Phe Asp Ala Ala Glu Trp Arg Pro Arg 
35 40 45 

Ala Lys Arg Asp Asp Ala Leu Lys Val Pro Leu Arg lie Leu Pro Leu 
50 55 60 

Gly Ala Ser lie Thr TrD Gly Tyr Leu Ser Ser Thr Gly Asn Glv Tyr 
65 70 75 " 80 

Arg Lys Pro Leu Arg Asp Lvs Leu Arg Phe Glu Gly Trp Glu Val Asp 
85 90 95 

Met Val Gly Lys Ala His Ser Gly Asp Val He Thr Gin Val Gin Thr 
100 105 110 

Ala Ala Ala Asn Ser Leu Ala Tyr Lys Pro Asn Val Val Leu He Asn 
115 120 125 

Ala Gly Tnr Asr. Asp Cys Asp Tyr Asn Vai Asp Pro Ala Asn Ala Gly 
130 135 140 

Glu Arg Met Arg Ser Leu. He Glu Thr Leu He Gly Ala Pro Asp Met 
145 150 155 16C 

Ala Asn Thr Leu He Val Leu Ser Thr Leu He Pro Ser Gly Ser Thr 
165 170 175 

Thr Leu Glu Ala Asn Arg Pro Ser Val Asn Ala Gin Phe Arg Glu Leu 
180 185 190 

Val Leu Asp Met Arg Glu Ala Gin Asn Val Ser He Val Leu Ala Asp 
195 200 205 

Met Asp Pro Pro Ala Pro Ser Pre Glv As n Asn Trp He Thr Tyr Pro 
210 215 " 220 
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225 P ~- ~ iS HiS rro Asr ' A :- D 'yr Gly Tyr Ser Gin 

2j - 240 

Me, Ma Asp He Trp T yr Asn Ala Iie Tyr Asn Ala Ma Val Ma Glu 



25C 255 
260 ~ y L6U Ile Ser Ser Thr Gly Thr Cys 



Leu Ile Val Lys Pre Ala Asc 

265 



Asp Lys Gl u Tvr Glv ^ r r'w t-. « - ^- 

275 ' V 2sJ yr Giy Gly Ph f Thr Gin G1 " 

2 8 d 

Giy Ser Gly ciu Asp flsp ^ Ug Tyr Arg ^ ^ ^ ^ 

300 

Gly Ala Leu Phe Thr Val Ar g Ala Gly Lys Gly Ala Ala Asp Pro T yz 



320 

Lys Asp Asp Aso Glu Leu H^s PhP Ph^ dk. 

* 325 e PhG G1 V Ar 9 Leu Tyr Thr Arg 

330 335 
Ala Tyr Asp Asp Met Met lie Phe Hi^ t w«= t 

340 V P Lys Asp Ser G1 V AJ -a 

J4b 350 



val Thr Phe Val Ser Tyr Thr A Asn Val Thr Glu ^ ^ 

Jb0 365 

Phe Thr Lys Gly Gly Thr Phe s P r Thr « 

370 Y £J? Ser Thr Hls Asn Asn Cys Asn Pro Gly 

J/b 380 Y 

Gly Val His P he lie Asp Ile Asn Gly Asp G ly Leu Asp Asp Tyr Ile 

395 400 
Cys Ile Ma Leu Asp Gly Thr Thr Tyr Ala Ser lie Asn Asn Gly Asp 



Gly Asp Ma Lys Ser Asn Lys Pro Pro Ser Phe Thr Asp lie Gly Leu 

qzb - ' 430 



Trp Lys Ser Pro Glu Gly Tyr Asp Gin Ma His Val Ar g Leu Ala Asp 

440 445 

He Asp Gly Asp Gly Arc Ala Asp Tyr Cys Gly Leu Ma Asp Asn Gly 

^ b 460 y 

Asp Val Thr cys Trp Asn Gly ^ ^ ne a ^ ^ 

475 480 

Trp Gin Pro Leu Gly Lys Arg Phe Thr Gly Lys Val Met Gly Asp Leu 



4 90 4 9 - 5 



iot G1U ASP 116 ?»" G ^ **P Gly Arg Asp Aso Trp 

505 510 

Met Trp Val Asp Asp Asp Gly Ma Thr Thr Thr Tyr Thr Asn Ser Arg 



525 



Ser Cys lie Lys Gly Glu ser Gly Asp Gly L eu Asn Val Val Tro Arg 

540 ~ 



Gin Gly Phe Tyr Gln Ma Asn Ser Giy Pro Ser His Pro Gly 

Gly Val lie Phe Gly Thr Ser Gly Leu Aro Asp Gin Val Tyr ph 
56j - 7 0 57" 5 



e Al; 
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Arg Leu Tyr Glv Glu Val Ala Asp Phe Gly GIu Leu Gly Arg Gin Asp 
580 585 590 

Tyr Val Phe lie Lvs Lvs Asp Thr Ser Asd Lys Tyr Phe Gly Pre Leu 
595 " " 600 605 

Tyr Tyr Val His Val Trp Lvs Ser Lvs Glv Ala Gly Gly Ala Lys lie 
610 615 " - 620 

Lys Ala Asp Giy Asc Arg Tyr Cys Asn Met Met Gly His Asp Asn Gly 
625 1 630 635 640 

Met Met Asp Tvr lie Trp lie His Ser Thr Gly His Met Arg Leu Tyr 
645 650 655 

Pro Asn Arg Glv Leu Val Glu Val Pro Ala Asp Gly Ser Ser Phe Trp 
660 665 670 

Gly Ala Asn Glu lie lie Phe Asp Pro-Gin Glu Gin lie Giy Met Lys 
675 680 685 

Leu Asp Arg Arg Asp Leu His Leu Ala Asp Trp Asp Gly Asp Gly Ala 
690 695 700 

Cys Asp lie lie Trp Thr Asp Pro Asp Asn Leu Asn Arg Ala Gin Val 
705 710 715 720 

Trp Arg Asn Lys lie Lys Asp Thr Gly Ser Phe Asp Trp Asp Tyr Asn 
725 730 735 

lie Asn Ala Ala Asp Glu Leu Tyr Cys Pro Glu His Arg Giy Leu Gly 
740 745 750 

Phe Phe Asp Arg Pro Val His Phe Ala Asp Val Ser Gly Asn Gly Lys 
755 760 765 

Ala Asp Tyr Leu Cys Val Glu Lys Asp Gly Arg Thr Trp Gly Trp Val 
770 775 780 

Asn Gly Asp Asp Gly Trp Asp Tyr lie Asp Gin Phe Lys Tyr Ser Glu 
785 790 795 800 

Glu Lys Asp Arg Ala Asn Leu His Trp Ala Asp Val Asn Gly Asp Gly 
805 810 815 

Lys Ala Asp Met lie Trp Thr Asp Lys Phe Ser Gly Asp Gly Ser Val 
820 825 830 

Trp Tyr Asn Leu Gly Gin Arg Asp lie Lys Gly Ser Arg Tyr Glu Trp 
835 840 845 

Gly Pro Gin Gly Pro Lys Tyr Arg Gly Ala Val Glu Gly Ser Cys Thr 
850 855 860 

Tyr Phe Pro Asp Leu Asn Giy Asp Gly Arg Ala Asp Met His Ser He 
865 870 875 880 

Trp Asn Ser He Asn Asn Thr Ala Gin Thr Trp Tyr Asn Glu Cvs Ala 
885 890 895 

Thr Lys Asp His Thr Gly Asp Asp Gly Pro He Thr Asn Pro Asn Leu 
900 905 910 

Pro Val Ser Pro Val Lys Ala Pro He Glu Leu Thr Pro His Tyr Gin 
915 920 925 
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Asp Asn Ser Glu Cys Thr Arg Aia Gin vai Gin Thr Leu Phe Glu Glu 

935 940 

Met Gin Tyr Aia Leu Asp Aia Ala Ser Glu Val Ala Tyr Phe Ser Gly 
945 950 955 9 60 

Gly Ala Tyr Asp Pro Tyr Arq Asp lie Phe Phe Aia Glu Ser Leu Thr 
965 970 975 

Asp Ser Leu Thr Phe Thr lie Asn Val Arg Tyr Thr Phe Asd Arq Met 
560 985 99 q 

Val Thr Met lie Ser Gly Ser Ser Gin Phe Asp Asp Glu Lys Phe T hr 
555 1000 1005 

Il€ M?n CyS LyS ASn Leu Arg Gly As P Glu Asn Gl y T ^P Leu Ala 

1010 1015 1020 

Met Met Asn Asn Arg Asn Arg Leu Asn Phe Cys Pro Lys Phe Phe Thr 
1025 1030 1035 1040 

Asp Glu Leu Lys Ser Ser Arg Arg Thr Arg Asp Tyr Val Tyr Glv Tro 
1045 1050 1055 

Lys Gly Aia Arg Asp Leu Aia Aia Gly Thr Phe Asn Arg His Cys lie 
1C60 1065 1070 

Glu Arg Gly Arg Lys Ala Glu Arg Ala Ala Asn Glu Leu Arg lie Ala 
1015 1080 1085 

Gly Asp Ala Asn Trp Gin Arg Arg Leu Leu Cys Pro Asp Pro Asn Asn 
1090 1095 iioo 

Leu Gly Gin Glu Giy lie Cys Asp Ser Lys Leu Ser Ala Tyr Asn Ala 
1105 IHO ins 112 o 

Asp Ser Trp Aia Leu Val Val Leu Gly Giy Tyr Tyr Thr Lys lie Cvs 
!125 H30 H35 

Gly Arg Gin lie Pro Leu Pro Glu Glu Ser Ala Ser Ser Ala Asp Asd 
11^0 H45 H50 

Ser Ser Cys Pro Ala Tyr Asp Asp Ser Ser Tyr Asp Ala Asp Thr Val 
1155 1160 1165 

Tyr Giy Val Asn Asp Tyr Val His Phe Gly Asp Ser Tyr Aia Ala Gly 
1 i 70 1175 1 8 0 

Met Gly Thr sly Thr Thr Thr Giy Asp Ser Cys Arg Val Gly Ser Asn 
1185 -190 H95 120Q 

Ser Tyr Gly Lys Leu Val Gin Glu Trp Phe Asp Thr Glu Asp Phe Thr 
1205 1210 1215 

Tyr Thr Asn Tyr Ala Cys Ser Gly Asp Thr Thr Val Gly Leu Asn Ly= 
1220 1225 1230 

Lys He Asp Gin Trp Leu Gly Gin Asp Fro Thr Gly Thr Thr Met Ala 

i2"30 1245 

Thr 1 tf^ Thr :le Gi V C1 'S Asn A SP '-'al Phe Phe Ser Asp Leu Val Ser 

^255 1260 

Asn Cys Val Leu Thr Met Trp Trp Tyr Ser Leu Glu Gin Tyr Arg Gin 
- 265 12/C 1275 1280 
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Trp Cys Leu Glu Thr Glu Glu Lys Ala Arc Asn Leu Met Gin Asp Thr 
12 8; 129 C 121- z 

Glv Ser Asp Giv Leu Glv Ser Lvs Leu Arc Ala Aia Tvr Glu Lvs lie 
* 1300 " " 1305 ' " 1310 

Leu Asp Ara Ser Glv Ser Ser Val Tvr Leu Pro Val lie Leu He Tyr 
1315 132C "* 1325 

Ser Cys Arg Ala Val Leu Ara Ara Ala Aso Phe Thr Leu Val Val Gin 
1330 1335 " * 1340 

Pro Leu Arg Pro Trp Leu Cys His Leu Leu Gin Aro Arg His His Ara 
1345 1350 1355 " 1360 

Leu Arg Leu Asn .His Leu Leu Glu Leu Asn Asd Leu Val Arg Met Leu 
1365 1370 1375 

Asn Ser Leu He Gin Ser Thr He Ser. Asp He Asn Thr Ala Arc Asn 
1380 1385 1390 

Thr Glu Gin He His Tyr lie Asp Met Asp Ala Arg Phe Asp Glv His 
1395 1400 1405 

Arg Trp Cys Glu Pro Gly Thr Gin Glu Pro Asp Pro Asp Asn Pro Asn 
1410 1415 1420 

Thr Tyr Phe Phe Leu Ser Ala Trp Pro Asp lie Ala He Val Glv Asp 
1425 1430 1435 ' 1440 

Thr Thr Ala Glu Ser Thr Asn Ala Thr Glu Thr Asp Glu He Thr Ala 
1445 1450 1455 

Leu Met Asn Ser Gly Ser lie Gin Leu Pro Asp Ala Asp Thr Cys Gin 
1460 1465 1470 

Asp Ala Leu Gly Ser Asp Pro Asp Pro Tyr Aia Val Phe Met Cys Asp 
1475 1480 1485 

Val Ala Val His Val Lys Ala Asn Ser Ser Ser Leu lie Ala Gin Ser 
1490 1495 1500 

Leu Asp Arg Ala Asn Gin Ala lie Ala Asn Arg Asp Tyr Ser Ser Gin 
1505 1510 1515 1520 

Asp Val Ser Trp Trp Leu Pro Ser Pro 
1525 

<210> 2 
<2H> 328 
<212> PRT 

<213> Aspergillus terreus 
<400> 2 

Met Thr Leu Pro Thr Leu Pro Asn Trp lie Arg Met Cys Val His Leu 
15 10 15 

Ser Leu Thr His Leu His Gin His Arg Ser Pro Lys Tyr Glu Ser lie 
20 25 30 

Pro lie Lys Ser lie Gin Ala Asn Ser His Arg lie Leu lie lie Leu 
35 40 45 

Thr Thr Ala Ser Phe Tyr Pro Gin lie Arg Cys lie Gin Leu Arc: Asn 
50 55 60 
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Ser Thr His Gly He Ser Thr Ala Tyr He Leu Phe Asn Leu lie Ser 
65 7 5 -,80 

Ala Thr Giu His Phe Thr He Leu Phe Ala Leu Leu Vai Asr Se- Gi w 
65 90 95 

Gly Asp Vai Leu lie His Giu Pre Pro Thr Thr Gly Aso Gly Leu A«n 
100 105 " 110 

Leu Tyr Gl.n Leu Phe Ala Vai Trp Met Gly Cys Leu Vai Leu Phe Cys 
115 120 125 

Gin Ala He His Ser Leu His Ala Asn Pro Ara Arg Lvs Leu lie Leu 
130 135 " 140 

Leu Thr He Tyr He Gin Tyr Leu Cvs He Ser lie Leu Pro Giu Vai 
1^5 150 155 160 

He Asp Ala lie Thr Thr Pro Giu Giu Thr Ara Lys Gin Ara Pro' Pro 
165 170 " " 175 

Thr Gly Giu Arg Asn Trp Leu He Gly Leu Phe Leu Ser Ala His Ala 
180 185 190 

Met Thr Vai Leu Pro Leu Ser Ala Vai Leu Arg lie Ala Gly Phe lie 
195 200 205 

Asp Gin Ser Arg Leu lie Ser Arg Arg Arg Arg Giu Gin Pro Ser Vai 
210 215 220 

Leu Ser Leu Thr Gly Leu Ala Cys Gin Ala Vai Vai Phe Ala Leu Vai 
225 230 235 240 

Ser Gly Leu Trp Vai Leu Arg Vai Gin Gin Pro Vai Pro Arg Met Pro 
245 250 255 

Met Arg Arg Pro Vai Asp Trp Met Tyr Trp Tyr His Vai lie Gly Trp 
260 265 270 

Pro Vai Vai Asp Asp Ala Vai Tyr Ala Leu Gly Gin Trp Vai Leu Phe 
275 280 285 

Trp Tyr Ala Vai Cys Trp Arg Ser Arg Gly Asp Ala Arg Asp Giu Ala 
290 295 300 

Vai His Ala Gly Giu Thr Asp Asp Leu Leu Gly Giu Asp Giu Glv His 
305 310 315 " 320 

Gly Tyr Gly Gly Thr Gly Thr Ser 
325 

<210> 3 
<211> 323 
<212> PRT 

<213> Aspergillus terreus 
<400> 3 

Met Vai Gly Ser Lys Leu Ala His Asn Giu Giu Trp Leu Asp lie Ala 
1 5 10 15 

Lys His His Ala Vai Thr Met Ala lie Gin Ala Arg Gin Leu Arq Leu 
20 25 30 

Trp Pro Vai lie Leu Arc Pro Leu Vai His Trc Leu Giu Pro Gin Gly 
35 40 ' 45 
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Ala Lys Leu Arc Ala Gin Vai Aro Ara Ala Ar g Gin Leu Leu Asp Pro 
50 - 55 " 6G 

lie lie Gin Giu Arg Arq Ala Glu Arc Asc Ala Cvs Aro Ala Lvs Giy 
65 70 75 " " " 80 

He Glu Pro Pro Arg Tyr Val Asp Ser lie Gin Trp Phe Giu Asp Thr 
85 90 95 

Ala Lys Gly Lys Trp Tyr Aso Ala Ala Gly Ala Gin Leu Ala Met Asp 
100 105 110 

Phe Ala Gly He Tyr Gly Thr Ser Asp Leu Leu lie Gly Glv Leu Val 
115 120 125 

Asp He Val Arg His Pro His Leu Leu Giu Pro Leu Arg Asp Glu He 
130 135 140 

Arg Thr Val He Gly Gin Gly Gly Trp Thr Pro Ala Ser Leu Tyr Lys 
145 150 155 160 

Leu Lys Leu Leu Asp Ser Cys Leu Lys Glu Ser Gin Arg Val Lys Pro 
165 170 175 

Val Glu Cys Ala Thr Met Arg Ser Tvr Ala Leu Gin Asp Val Thr Phe 
180 185 190 

Ser Asn Giy Thr Phe lie Pro Lys Giy Glu Leu Val Ala Val Ala Ala 
195 200 205 

Asp Arg Met Ser Asn Pro Glu Val Trp Pro Glu Pro Ala Lvs Tyr Asp 
210 215 220 

Pro Tyr Arg Tyr Met Arg Leu Arg Glu Asp Pro Ala Lys Ala Phe Ser 
225 230 235 240 

Ala Gin Leu Glu Asn Thr Asn Gly Asp His lie Gly Phe Gly Trp His 
245 250 255 

Pro Arg Ala Cys Pro Gly Arg Phe Phe Ala Ser Lys Glu lie Lys Met 
260 265 270 

Met Leu Ala Tyr Leu Leu lie Arg Tyr Asp Trp Lys Val Val Pro Asp 
275 280 285 

Glu Pro Leu Gin Tyr Tyr Arg His Ser Phe Ser Val Ara lie His Pro 
290 295 300 

Thr Thr Lys Leu Met Met Arg Arg Arg Asp Glu Asp lie Arg Leu Pro 
305 310 315 320 

Gly Ser Leu 



<210> 4 
<211> 256 
<212> PRT 

<213> Aspergillus terreus 
<400> 4 

Met Arg Tyr Gin Ala Ser Pro Ala Leu Val Lys Ala Pro Arg Ala Leu 
15 10 15 

Leu Cys He His Gly Ala Gly Cys Ser Pro Ala He Phe Arg Val Gin 
20 25 30 
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Leu ser Lys Leu Arg Ala Ala Leu Arc GIu Asn Phe Glu Phe Val Tyr 

Val Thr Ala Pro Phe Pro Ser Ai - r^w r-i -n 

5G ~ y Pro G1 y lle Leu p ro Val 

- 5 60 

Phe Ala Asp Leu Giy Pro Tyr Tyr Ser Trp Phe Glu Ser Ser Ser Asp 

7w ^ 80 

Asn Asn Krs Asn Gly Pro Ser Val Ser Glu Arg Leu Ala Ala Val Mrs 

90 9 5 

Asp Pro lie Arg Arg Thr He Val Asp Trp Gin Thr Gin His Pro His 

He Pro lie Val Gly Ala He Gly Phe Ser Glu Gly Ala Leu Val Thr 

12G 125 

Thr Leu Leu Leu Trp Gin Gin Gin Met- Gly His Leu Pro Trp Leu Pro 

l 2 4 0 

Arg Met Ser Val Ala Leu Leu He C ys Pro Trp Tyr Gin Asp Glu Ala 

155 160 
Ser Gin Tyr Met Arg Asn Glu Val Met Lys Asn His Asp Asp Aso Asn 

170 17 - 5 

Asp Ser Lys Asp Thr Glu Trp Gin Glu Glu Leu Val He Arg He Pro 

185 190 

Thr Leu His Leu Gin Gly Arg Asp Asp Phe Ala Leu Ala Gly Ser Lys 

200 205 
Met Leu Val Ala Arg Hrs Phe Ser Pro Arg Glu Ala Gin Val Leu Glu 

Phe Ala Gly Gin His Gin Phe Pro Asn Arg Pro Arg Asp Val Leu Glu 

235 240 
Val lie Asn Arg Phe Arg Lys Leu Cys Val Thr Ala Gin Thr Leu Glu 

25 ° 255 



<2I0> 5 
<211> 363 
<212> PRT 

<213> Aspergillus terreus 
<400> 5 

Met Gly Asp Gin Pro Phe lie Pro Pro Pro Gin Gin Thr Ala Leu Thr 

Val Asn Asp His Asp Glu Val Thr Val Trp Asn Ala Ala Pro Cys Pro 

25 30 



Met Leu Pro Arg Asp Gin Val Tyr Val Arg Val Glu Ala Val Ala lie 

40 45 

Asn Pro Ser Asp Thr Lys Met Arg Gly Gin Phe Ala Thr Pro Trp Ala 

" 60 
Phe Leu Gly Thr Asp Tyr Ala Gly Thr Val Val Ala Val Gly Ser Asp 



Val Thr His He Gin Val Gly Asp Arg Val Tyr Gly Ala Gin Asn Glu 

" 90 



80 

G 

95 
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Met Cvs Pro Arq Thr Pro Asd Gin Glv Ala ?he Ser Gin Tvr Vai 
10C 205 110 

Thr Arg Gly Arg Val Trp Ala Lys He Pro Lvs Gly Leu Ser Fhe Glu 
115 120 125 

Gin Ala Ala Ala Leu Pro Ala Glv lie Ser Thr Ala Gly Leu Ala Met 
130 135 140 

Lys Leu Leu Gly Leu Pro Leu Pro Ser Pro Ser Ala Asp Gin Pro Pro 
145 150 155 160 

Thr ills Ser Lys Pro Vai Tyr Val Leu Val Tvr Gly Gly Ser Th- Ala 
165 170 " 175 

Thr Ala Thr Val Thr Met Gin Met Leu Ara Leu Ser Gly Tyr lie Pro 
180 185 ' 190 

He Ala Thr Cys Ser Pro His Asn Phe Asd Leu Ala Lys Ser Arg Gly 
195 200 ' 205 

Ala Glu Glu Val Phe Asp Tvr Arg Ala Pro Asn Leu Ala Gin Thr He 
210 215 220 

Arg Thr Tyr Thr Lys Asn Asn Leu Arg Tyr Ala Leu Asp Cys lie Thr 
225 230 235 240 

Asn Val Glu Ser Thr Thr Phe Cys Phe Ala Ala He Gly Arg Ala Gly 
245 250 255 

Gly His Tyr Val Ser Leu Asn Pro Phe Pro Glu His Ala Ala Thr Arg 
260 265 270 

Lys Met Val Thr Thr Asp Trp Thr Leu Gly Pro Thr He Phe Gly Glu 
275 280 285 

Gly Ser Thr Trp Pro Ala Pro Tyr Gly Arg Pro Gly Ser Glu Glu Glu 
290 295 300 

Arg Gin Phe Gly Glu Asp Leu Trp Arg lie Ala Gly Gin Leu Val Glu 
305 310 315 320 

Asp Gly Arg Leu Val His His Pro Leu Arg Val Val Gin Gly Gly Phe 
325 330 335 

Asp His He Lys Gin Gly Met Glu Leu Val Arg Lys Gly Glu Leu Ser 
340 345 350 

Gly Glu Lys Leu Val Val Arg Leu Glu Gly Pro 
355 360 

<210> 6 
<211> 413 
<212> PRT 

<213> Aspergillus terreus 
<400> 6 

Met Gly Ser He lie Asp Ala Ala Ala Ala Ala Asp Pro Val Vai Leu 
1 5 10 15 

Met Glu Thr Ala Phe Arg Lys Ala Val Lys Ser Arg Gin lie Pro Gly 
20 25 30 

Ala Val He Met Ala Arg Asp Cys Ser Gly Asn Leu Asn Tyr Thr Arg 
35 40 45 
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Cys Phe Gly Ala Arc Thr Val Arc Ara Asp Giu Cys A sr. Glv Le'' c rc 
50 ' 55 eo 

Pro Leu Gin Vei Asp Thr Pro Cys Arc Leu Ala Ser Ala Thr Lys Leu 
65 70 75 80 

Leu Thr Thr lie Met Ala Leu Gin Cys Me: Glu Arg Giy Leu Val Asp 
85 90 95 

Leu Asp Glu Thr Val Asp Arg Leu Leu Pro Asp Leu Ser Ala Met Pro 
100 105 110 

Val Leu Giu Glv Phe Aso Asp Ala Giy Asn Ala Aro Leu Arg Glu Arc 
115 120 12 5 

Arg Gly Lys lie Thr Leu Arg His Leu Leu Thr His Thr Ser Gly Leu 
130 135 140 

Ser Tyr Val Phe Leu His Pro Leu Leu. Arg Glu Tyr Met Ala Gin Gly 
145 150 155 160 

His Leu Gin Ser Ala Glu Lys Phe Giy lie Glx Ser Arg Leu Ala Pro 
165 170 175 

Pro Ala Val Asn Asp Pro Giy Ala Glu Trp lie Tyr Giy Ala Asn Leu 
180 165 190 

Asp Trp Ala Gly Lys Leu Val Giu Arg Ala Thr Glv Leu Asp Leu Glu 
195 200 " 205 

Gin Tyr Leu Gin Glu Asn lie Cys Ala Pro Leu Gly lie Thr Asp Met 
210 215 220 

Thr Phe Lys Leu Gin Gin- Arg Pro Asp Met Leu Ala Arg Arg Ala Asp 
225 230 235 240 

Gin Thr His Arg Asn Ser Ala Asp Gly Arg Leu Arg Tyr Asp Asp Ser 
245 250 255 

Val Tyr Phe Arg Ala Asp Gly Glu Glu Cys Phe Gly Gly Gin Gly Val 
260 265 270 

Phe Ser Gly Pro Gly Ser Tyr Met Lys Val Leu His Ser Leu Leu Lys 
275 280 285 

Arg Asp Gly Leu Leu Leu Gin Pro Gin Thr Val Asp Leu Met Phe Gin 
290 295 300 

Pro Ala Leu Glu Pro Arg Leu Glu Glu Gin Met Asn Gin His Met Asp 
305 310 315 320 

Ala Ser Pro His He Asn Tyr Gly Giy Pro Met Pro Met Val Leu Arg 
325 330 335 

Arg Ser Phe Gly Leu Gly Gly He He Ala Leu Glu Asd Leu Asp Gly 
340 345 * 350 

Glu Asn Trp Arg Arg Lys Giy Ser Leu Thr Phe Gly Giy Gly Pro Asn 
355 360 365 

He Val Trp Gin lie Asp Pro Lys Ala Gly Leu Cys Thr Leu Ala Phe 
370 375 380 

Phe Gin Leu Glu Pro Trp Asn Asd Pro Val Cys Arg Asd Leu Thr Arg 
385 390 395 * 400 
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Thr Phe Glu His Ala lie Tyr Ala Gin Tvr G 1 
405 4 "lC 



n Gin 



Gly 



<210> 7 
<211> 1068 
<212> PRT 

<213> Aspergillus terreus 
<400> 7 

Met Asp Pro Vai Val Arg Lys Pro Asp Pro Gly Gly Val Gin His Arg 
i 5 10 15 

Val Thr Lys Ala Leu Arg Ala He Val Gly His Ala Gys Arg His Pro 
20 25 30 

He His Thr Leu Leu Val Thr Ala Leu Thr Ala Ala Thr Thr His Leu 
35 40 45 

His Val Leu Glu Gly Thr Tvr Gin Ala Thr His Arg Glu Ala Ser Ala 
50 55 60 

Trp Lys Trp Gin He Asp Asp Arg Pro Lys Val Pro Glu Asp Gly Gin 
65 ^ 70 75 80 

Ser Asp Phe His Trp Ala Leu Val Thr Leu Asp Leu Pro Gly Ala Ser 
85 90 95 

Val Asp Ala Ser He Pro Phe Leu Ser Asn Thr Leu Ser Gly Phe Leu 
100 105 110 

Gly Ala Glu Gin Thr Thr Pro Thr Pro Asp Ser Ser Pro Ser Pro Asp 
115 120 125 

His Ser Ala Leu Thr Phe Arg Val Pro Tyr Ser Gin Leu Asp Gly Phe 
130 135 140 

Leu Gin Ala Val Glu He lie Pro Ser Glu Lys Glu Asp Asp Ser Trp 
145 150 155 160 

Arg Leu Arg Ser Pro Arg Glu Glu Gly Ser Pro Arg Ser Leu Gly His 
165 170 175 

Trp Leu Giy Ser Ser Trp Leu Ser Phe Leu His Arg Val His His Ala 
180 185 190 

Glu Thr Val Asp Leu Val He He Gly Leu Ser Tyr Leu Ala Met Asn 
195 200 205 

Met Thr Val Val Ser Leu Phe Arg Val Met Arg His Leu Gly Ser Arq 
210 215 220 

Phe Trp Leu Ala Ala Ser Val Leu Leu Ser Gly Ala Phe Ala Phe Val 
225 230 235 240 

Leu Gly Leu Gly He Thr Thr Thr Cys Asp Val Pro Val Asp Met Leu 
245 250 255 

Leu Leu Phe Glu Gly He Pro Tyr Leu Val Leu Thr Val Gly Phe Glu 
260 265 270 

Lys Pro He Gin Leu Thr Arg Ala Val Leu Cys Val Ser Glu Glu Leu 
275 280 285 

Trp Gly Gly Gly. Gin Arg Gin Val Pro Asn Gly Ala Ser Ser Asp Asp 



290 



295 



300 
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Ser Arg Gin Asn Gin Leu lie Pro Asn lie lie Gin Leu Ala Vai Asp 
305 310 315 320 

Arg Glu Giy Trp Tyr He Val Arc Ser Tvr Leu Leu Giu He Giy Ala 
325 330 335 

Leu Ala Leu Giy Ala Vai Leu Arc Pro Lys Asd Ser Leu Giy His Phe 
340 345' ' 350 

Cys Phe Leu Ala Ala Tro Thr Leu Leu He Asp Ala Vai Leu Leu Phe 
355 360 365 

Thr Phe Tyr Ala Thr He Leu Cys Val Lys Leu Glu He Thr Ara lie 
370 375 380 

Arg Ser Pro Giy Giy Leu Giy Gin Val Asn Ala Lys His Pro Ser Giy 
385 390 395 400 

lie Phe Giy His Lys Vai Lys Ser Thr Asn lie Thr Trp Trp Lys Leu 
405 410 415 

Leu Thr Val Giy Giy Phe Val Leu Cys His Phe Leu Gin Leu Ser Pro 
420 425 430 

Phe Phe Tyr Arg Val Met Giy Glu Tyr Met Ala Asn Giy Thr Leu Pro 
435 440 445 

Pro Thr Ala Val Ser Pro Phe Lys Glu Ala Ala Asn Giy Leu Asn Glu 
450 455 460 

He Tyr Leu Thr Ala Arg Val Glu Giy Phe Glu Thr Arg Val Thr Val 
465 470 475 480 

Leu Pro Pro Leu Gin Tyr Val Leu Glu Ser Ala Giy Phe Asn lie Ser 
485 490 495 

Ala Thr Lys Arg Ser Thr Phe Asp Giy Val Leu Asp Giy Leu Glu Ser 
500 505 510 

Pro Leu Giy Arg Leu Cys Leu Met Giy Ala Leu Val Val Ser Leu Val 
515 520 525 

Leu Asn Asn His Leu lie His Ala Ala Arg Trp His Ala Trp Pro Gin 
530 535 540 

Ala Arg Glu Ser Ala Val Pro Asp Giy Ser Tyr Leu Ser Val Pro Cys 
545 550 555 560 

Ser Ala Thr Ala Pro Glu Val Cys Thr Arg Pro Pro Glu Glu Thr Glu 
565 570 575 

Ala Leu Leu Lys Ser Asn Gin Ala Glu Ser Leu Thr Asp Asp Glu Leu 
580 585 590 

Val Glu Leu Cys Leu Arg Giy Lys lie Ala Giy Tyr Ser Leu Glu Lys 
595 600 605 

Thr Leu Glu Arg lie Ala Ala Giy Ser Ser Arg Ser Val Thr Arg Leu 
610 615 620 

Glu Ala Phe Thr Arg Ala Val Arg lie Arg Arg Ala Ala Val Ser Lys 
625 630 635 640 

Thr Pro Ser Thr Gin Asn Leu Cys Ser Giy Leu Ala Glu Ser Leu Leu 
645 650 655 
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Pro Tvr Arg Asd Tvr Asn Tvr Glu Leu Val His Giy Ala Cvs Cys GIu 
660 " 665 670 

Asn Val Val Gly Tvr Leu Pro Leu Pro Leu Gly Val Ala Gly Pro Met: 
675 680 685 

Val He Asp Giy Gin Ala Leu Phe He Pro Met Ala Thr Thr GIu Gly 
690 695 700 

Val Leu Val Ala Ser Ala Ser Arg Giy Cys Lys Ala He Asn Ala Gly 
~j5 710 715 720 

Sly Glv Ala Thr Thr Met Leu Lys Glv Asd Gly Met Thr Arq Glv Pre 
725 " 730 " " "25 

Cys Leu Arg Phe Pro Ser Ala Gin Arg Ala Ala GIu Ala Gin Arg Tro 
740 745 750 

Val GIu Ser Pro Leu Gly His Glu Val Leu Ala Ala Ala Phe Asn Ala 
755 760 765 

Thr Ser Arg Phe Ala Arg Leu Gin Thr Leu Thr Val Ala Gin Ala Glv 
770 775 780 

lie Tvr Leu Tyr He Arg Phe Arg Thr Thr Thr Gly Asp Ala Met Gly 
755 " 790 795 800 

Met Asn Met lie Ser Lys Gly Val Glu Lys Ala Leu Glu Ala Met Ala 
805 810 815 

Ala Glu Gly Gly Phe Pro Asp Met His Thr Val Thr Leu Ser Gly Asn 
820 825 830 

Phe Cys Ser Asp Lys Lys Ser Ala Ala He Asn Trp He Gly Giy Arg 
335 " 84C " 845 

Glv Lvs Ser Val He Ala GIu Ala Thr He Pro Ala Glu Thr Val Ara 
850 855 860 

Gin Val Leu Lys Thr Asp Val Asp Ala Leu Val Glu Leu Asn Thr Ala 
865 870 875 880 

Lys Asn Leu Val Gly Ser Ala Met Ala Gly Ser Leu Gly Gly Phe Asn 
885 890 895 

A.Ia His Ala Ser Asn Leu Val Gin Ala Val Phe Leu Ala Thr Gly Gin 
900 905 910 

Asc Pro Ala Gin Asn Val Glu Ser Ser Ser Cys He Thr Thr Met Lys 
915 920 925 

Asn He Asp Gly Asn Leu His He Ala Val Ser Met Pro Ser Met Glu 
'930 935 940 

Val Gly Thr He Gly Gly Gly Thr He Leu Glu Ala Gin Gly Ala Met 
945 950 955 960 

Leu Asp Leu Leu Gly Val Arc Glv Ala His Ser Thr Glu Pro Gly Ala 
965 970 975 

Asn Ala Ara Ara Leu Ala Arg He Val Ala Ala Ala Val Leu Ala Glv 
980 985 990 

Giu Leu Ser Thr Cvs Ala Ala Leu Ala Ala Gly His Leu Val Asn Ala 
995 1000 1005 
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His Met Gin His Asn Ara Thr Ser Lys Aso Ala lie Ser Giy Thr Glu 
1010 "l015 " 1020 

Tvr Gly Ala lie Arc Thr Fro Val Tvr Val Val lie Leu Giu His Ala 
1025 1030 ' - 1035 1040 

Giy Asp lie His Phe Val Gin lie Glu Tyr Lys Asn Thr Tyr Leu Ara 
1045 1050 1055 

Arg Lys Val Pro Thr Leu Ser Cys Asn Leu Gly Arg 
1060 1065 

<210> 8 
<211> 503 
<212> PRT 

<213> Aspergillus rerreus 

<400> 8 - 

Met Ala Ala Asp Gin Gly lie Phe Thr Asn Ser Val Thr Leu Ser Pro 
15 10 15 

Val Glu Gly Ser Arg Thr Gly Gly Thr Leu Pro Arg Arg Ala Phe Arg 
20 25 30 

Arg Ser Cys Asp Arg Cys His Ala Gin. Lys lie Lys Cys Thr Gly Asn 
35 40 45 

Lys Glu Val Thr Gly Arg Ala Pro Cys Gin Arg Cys Gin Gin Ala Gly 
50 55 60 

Leu Arg Cys Val Tyr Ser Glu Arg Cys Pro Lys Arg Lys Leu Arg Gin 
65 70 75 80 

Ser Arg Ala Ala Asd Leu Val Ser Ala Asp Pro Asp Pro Cys Leu His 
85 90 95 

Met Ser Ser Pro Pro Val Pro Ser Gin Ser Leu Pro Leu Asp Val' Ser 
100 105 110 

Glu Ser His Ser Ser Asn Thr Ser Arg Gin Phe Leu Asp Pro Pro Asp 
115 120 125 

Ser Tyr Asp Trp Ser Trp Thr Ser lie Gly Thr Asp Glu Ala lie Asp 
130 135 140 

Thr Asp Cys Trp Gly Leu Ser Gin Cys Asp Gly Gly Phe Ser Cys Gin 
145 150 155 160 

Leu Glu Pro Thr Leu Pro Asp Leu Pro Ser Pro Phe Glu Ser Thr Val 
165 170 175 

Glu Lys Ala Pro Leu Pro Pro Val Ser Ser Asp lie Ala Arg Ala Ala 
180 185 190 

Ser Ala Gin Arg Glu Leu Phe Asp Asp Leu Ser Ala Val Ser Gin Glu 
195 200 205 

Leu Glu Glu lie Leu Leu Ala Val Thr Val Glu Trp Pro Lys Gin Glu 
210 215 220 

lie Trp Thr Arg Ala Ser Pro His Ser Pro Thr Ala Ser Arg Glu Arg 
225 230 235 240 

lie Ala Gin Arg Arg Gin Asn Val Trp Ala Asn Trp Leu Thr Asp Leu 
245 250 255 
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His Met Fhe Ser Leu Asp Pro lie Gly Met Phe Phe Asn Ala Ser Arg 
260 265 270 

Arc Leu Leu Thr Val Leu Arg Gin Gin Ala Gin Ala Asp Cys His Gin 
275 280 285 

Gly Thr Leu Asp Glu Cys Leu Ara Thr Lvs Asn Leu Phe Thr Ala Val 
290 295 " 300 

His Cys Tyr lie Leu Asn. Val Arg lie Leu Thr Ala lie Ser Glu Leu 
305 310 315 320 

Leu Leu Ser Gin lie Arg Arg Thr Gin Asn Ser His Met Ser Pro Leu 
325 330 335 

Glu Gly Ser Arg Ser Gin Ser Pro Ser Arc Asp Asp Thr Ser Ser Ser 
340 345 350 

Ser Gly His Ser Ser Val Asp Thr lie Pro Phe Phe Ser Glu Asn Leu 
355 360 - 365 

Pro lie Gly Glu Leu Phe Ser Tyr Val Asp Pro Leu Thr His Ala Leu 
370 375 380 

Phe Ser Ala Cys Thr Thr Leu His Val Gly Val Gin Leu Leu Arg Glu 
385 390 395 400 

Asn Glu lie Thr Leu Gly Val His Ser Ala Gin Gly lie Ala Ala Ser 
405 410 415 

lie Ser Met Ser Gly Glu Pro Gly Glu Asp He Ala Arg Thr Gly Ala 
420 425 430 

Thr Asn Ser Ala Arg Cys- Glu Glu Gin Pro Thr Thr Pro Ala Ala Arg 
435 440 445 

Val Leu Phe Met Phe Leu Ser Asp Glu Gly Ala Phe Gin Glu Ala Lys 
450 455 460 

Ser Ala Gly Ser Arg Gly Arg Thr He Ala Ala Leu Arg Arg Cys Tyr 
465 470 475 480 . 

Glu Asp lie Phe Ser Leu Ala Arg Lys His Lys His Gly Met Leu Arg 
485 490 495 

Asp Leu Asn Asn He Pro Pro 
500 

<210> 9 
<211> 542 
<212> PRT 

<213> Aspergillus terreus 
<400> 9 

Met Thr Ser His His Gly Glu Thr Glu Lys Pro Gin Ser Asn Thr Ala 
15 10 15 

Gin Met Gin He Asn His Val Thr Gly Leu Arg Leu Gly Leu Val Val 
20 25 30 

Val Ser Val Thr Leu Val Ala Phe Leu Met Leu Leu Asp Met Ser lie 
35 40 45 

He Val Thr Ala lie Pro His lie Thr Ala Gin Phe His Ser Leu Gly 
50 55 60 
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Asp Val Giy Trp Tyr Giy Ser Ala Tyr Leu Leu Ser Ser Cys Ala Leu 
65 70 75 80 

Gin Pro Leu Ala Giy Lys Leu Tyr Thr Leu Leu Thr Leu Lys Tyr Thr 
85 90 95 

Phe Leu Ala Phe Leu Giy Leu Phe Glu lie Giv Ser Val Leu Cvs Giy 
100 105 " 110 

Thr Ala Arg Ser Ser Thr Met Leu lie Val Giy Arg Ala Val Ala Giy 
H5 120 125 

Met Giy Giy Ser Giy Leu Thr Asn Giy Ala lie Thr lie Leu Ser Ala 
130 135 140 

Ala Ala Pro Lys Gin Gin Gin Pro Leu Leu lie Giy lie Met Met Giy 
145 150 155 160 

Leu Ser Gin He Ala He Val Cys Giy. Pro Leu Leu Giy Giy Ala Phe 
165 170 175 

Thr Gin His Ala Ser Trp Arg Trp Cys Phe Tyr He Asn Leu Pro He 
180 185 190 

Giy Ala Phe Ala Thr Phe Leu Leu Leu Val He Gin He Pro Asn Arg 
195 200 205 

Leu Pro Ser Thr Ser Asp Ser Thr Thr Asp Giy Thr Asn Pro Lys Arq 
210 215 220 

Arg Giy Ala Arg Asp Val Leu Thr Gin Leu Asp Phe Leu Giy Phe Val 
225 230 235 240 

Leu Phe Ala Giy Phe Ala He Met lie Ser Leu Ala Leu Glu Trp Giy 
245 250 255 

Giy Ser Asp Tyr Ala Trp Asn Ser Ser Val He He Giy Leu Phe Cys 
260 265 270 

Ala Ala Giy Val Ser Leu Val Leu Phe Giy Cys Trp Glu Arg His Val 
275 280 285 

Giy Giy Ala Val Ala Met He Pro He Ser Val Ala Ser Arg Ara Gin 
290 295 300 

Val Trp Cys Ser Cys Phe Phe Leu Giy Phe Phe Ser Giy Ala Leu Leu 
305 310 315 320 

lie Phe Ser Tyr Tyr Leu Pro lie Tyr Phe Gin Ala Val Lys Asn Val 
325 330 335 

Ser Pro Thr Met Ser Giy Val Tyr Met Leu Pro Giy lie Giy Giy Gin 
340 345 350 

lie Val Met Ala lie Val Thr Giy Ala lie lie Giy Lys Thr Giy Tyr 
355 360 365 

Tyr Val Pro Trp Ala Leu Ala Serdy lie Leu Val Ser He Ser Ala 
370 375 380 

Giy Leu Val Ser Thr Phe Gin Pro Glu Thr Ser lie Ala Ala Trp Val 
385 390 395 400 

Met Tyr Gin Phe Leu Giy Giy Val Giy Arg Giy Cys Giy Met Gin Thr 
405 410 415 
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Pre Vci Vai Ala lie Glr. Asn Ala Leu Pre Pre G2r. Thr Ser Pre lie 
420 425 430 

Gly lie Ser Leu Ala Met Phe Gly Gin Thr Phe Gly Gly Ser Leu Phe 
435 440 445 

Leu Thr Leu Thr Giu Leu Vai Phe Ser Asn Gly Leu Asd Ser Gly Leu 
450 455 460 

Arg Gin Tyr Ala Pre Thr Leu Asn Ala Gin Glu Vai Thr Ala Ala Gly 
465 470 475 480 

Ala Thr Gly Phe Arg Gin Vai Vai Pro Ala Pro Leu lie Ser Arg Vai 
485 490 495 

Leu Leu Ala Tyr Ser Lys Gly Vai Asp His Ala Phe Tyr Vai Ala Vai 
500 505 510 

Gly Ala Ser Gly Ala Thr Phe lie Phe. Ala Trp Gly Met Gly Arg Leu 
515 520 525 

Ala Trp Arg Gly Trp Arg Met Gin Glu Lys Gly Arg Ser Glu 
530 535 540 

<210> 10 
<211> 2532 
<212> PRT 

<213> Aspergillus terreus 
<400> 10 

Met Thr Pro Leu Asp Ala Pro Gly Ala Pro Ala Pro lie Ala Met Vai 
15 10 15 

Gly Met Gly Cys Arg Phe Gly Gly Gly Ala Thr Asp Pro Gin Lys Leu 
20 25 30 

Trp Lys Leu Leu Glu Glu Gly Gly Ser Ala Trp Ser Lys lie Pro Pro 
35 40 45 

Ser Arg Phe Asn Vai Gly Gly Vai Tyr His Pro Asn Gly Gin Arg Vai 
50 55 60 

Gly Ser Met His Vai Arg Gly Gly His Phe Leu Asp Glu Asp Pro Ala 
65 70 75 80 

Leu Phe Asp Ala Ser Phe Phe Asn Met Ser Thr Glu Vai Ala Ser Cys 
85 90 95 

Met Asp Pro Gin Tyr Arg Leu lie Leu Glu Vai Vai Tyr Glu Ala Leu 
100 105 110 

Glu Ala' Ala Gly He Pro Leu Glu Gin Vai Ser Gly Ser Lys Thr Gly 
115 120 125 

Vai Phe Ala Gly Thr Met Tyr His Asp Tyr Gin Gly Ser Phe Gin Arg 
130 135 140 

Gin Pro Glu Ala Leu Pro Arg Tyr Phe He Thr Gly Asn Ala Gly Thr 
145 150 155 160 

Met Leu Ala Asn Arg Vai Ser His Phe Tvr Asp Leu Arg Gly Pro Ser 
165 170 175 

Vai Ser He Asp Thr Ala Cys Ser Thr Thr Leu Thr Ala Leu His Leu 
180 185 190 
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Ala He Gin Ser Leu Arc Ala Glv Glu Ser Aso Met: Ala lie Val Ala 
195 200 ' 205 

Gly Ala Asn Leu Leu Leu Asn Pro Asp Val ?he Thr Thr Met Ser £sn 
210 215 220 

Leu Gly Phe Leu Ser Ser Asp Gly lie Ser Tvr Ser Pne £sp Se- Ara 
225 230 235 " 240 

Ala Asp Gly Tyr Gly Arg Gly Glu Gly Val Ala Ala lie Val Leu Lys 
245 250 255 

Thr Leu Pro Asp Ala Val Arg Asp Gly Asd Pro He Arg Leu He Val 
260 265 270 

Arg Glu Thr Ala He Asn Gin Asp Gly Arg Thr Pro Ala He Ser Thr 
275 280 285 

Pro Ser Gly Glu Ala Gin Glu Cys Leu He Gin Asp Cys Tyr Gin Lys 
290 295 300 

Ala Gin Leu Asp Pro Lys Gin Thr Ser Tyr Val Glu Ala His Gly Thr 
305 310 315 320 

Gly Thr Arg Ala Gly Asp Pro Leu Glu Leu Ala Val He Ser Ala Ala 
325 330 335 

Phe Pro Gly Gin Gin He Gin Val Gly Ser Val Lys Ala Asn He Glv 
340 345 350 

His Thr Glu Ala Val Ser Gly Leu Ala Ser Leu He Lys Val Ala Leu 
355 360 365 

Ala Val Glu Lys Gly Val He Pro Pre Asn Ala Arg Phe Leu Gin Pro 
370 375 380 

Ser Lys Lys Leu Leu Lys Asp Thr His He Gin He Pro Leu Cys Ser 
385 390 395 400 

Gin Ser Trp lie Pro Thr Asp Gly Val Arg Arg Ala Ser He Asn Asn 
405 410 415 

Phe Gly Phe Gly Gly Ala Asn Ala His Ala He Val Glu Gin Tyr Glv 
420 425 430 

Pro Phe Ala Glu Thr Ser lie Cys Pro Pro Asn Gly Tvr Ser Gly Asn 
435 440 445 

Tyr Asp Gly Asn Leu Gly Thr Asp Gin Ala His lie Tyr Val Leu Ser 
450 455 460 

Ala Lys Asp Glu Asn Ser Cys Met Arg Met Val Ser Arg Leu Cys Asd 
^65 470 475 480 

Tyr Ala Thr His Ala Arg Pro Ala Asp Asp Leu Gin Leu Leu Ala Asn 
485 490 495 

lie Ala Tyr Thr Leu Gly Ser Arg Arg Ser Asn Phe Arg Tro Lys Ala 
500 505 510 " 

Val Cys Thr Ala His Ser Leu Thr Gly Leu Ala Gin Asn Leu Ala Gly 
515 520 525 

Glu Gly Met Arg Pro Ser Lys Ser Ala Asp Gin Val Arg Leu Gly Trp 
530 535 540 
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Val Phe Thr Gly Gin Glv Ala Gin Trp Phe Aia Met Giv Arc Glu Leu 
545 550 555 " ' 560 

lie Glu Met Tyr Pre Val Phe Lvs Glu Ala Leu Leu Glu Cys Asc Gly 
565 570 575 

Tyr lie Lys Glu Met Gly Ser Thr Trp Ser lie lie Glu Glu Leu Ser 
580 585 590 

Ara Pro Glu Thr Glu Ser Arg Val Asp Gin Aia Glu Phe Ser Leu Pro 
595 600 605 

Leu Ser Thr Ala Leu Gin lie Ala Leu Val Arg Leu Leu Trp Ser Trp 
610 615 620 

Asn lie Gin Pro Val Ala Val Thr Ser His Ser Ser Gly Glu Ala Aia 
625 630 635 640 

Ala Ala Tyr Ala lie Gly Ala Leu Thr Ala Arg Ser Ala lie Gly lie 
645 650 655 

Ser Tyr lie Arg Gly Ala Leu Thr Ala Arg Asp Arg Leu Ala Ser Val 
660 665 670 

His Lys Gly Gly Met Leu Ala Val Gly Leu Ser Arg Ser Glu Val Gly 
675 680 685 

lie Tyr He Arg Gin Val Pro Leu Gin Ser Glu Glu Cys Leu Vai Val 
690 695 700 

Gly Cys Val Asn Ser Pro Ser Ser Val Thr Val Ser Gly Asp Leu Ser 
705 710 715 720 

Ala He Aia Lys Leu Glu Glu Leu Leu His Ala Asp Arg He Phe Ala 
725 730 735 

Arg Arg Leu Lys Val Thr Gin Ala Phe His Ser Ser His Met Asn Ser 
740 745 750 

Met Thr Asp Ala Phe Arg Ala Gly Leu Thr Glu Leu Phe Gly Ala Asp 
755 760 765 

Pro Ser Asp Ala Ala Asn Ala Ser Lys Asp Val He Tyr Ala Ser Pro 
770 775 780 

Arg Thr Gly Ala Arg Leu His Asp Met Asn Arg Leu Arg Asp Pro He 
785 790 795 800 

His Trp Val Glu Cys Met Leu His Pro Val Glu Phe Glu Ser Ala Phe 
805 810 815 

Arg Arg Met Cys Leu Asp Glu Asn Asp His Met Pro Lys Val Asp Arg 
820 825 830 

Val He Glu He Gly Pro His Gly Ala Leu Gly Gly Pro He Lys Gin 
835 840 845 

He Met Gin Leu Pro Glu Leu Ala Thr Cys Asp He Pro Tyr Leu Ser 
850 855 860 

Cys Leu Ser Arg Gly Lys Ser Ser Leu Ser Thr Leu Arg Leu Leu Ala 
865 870 875 880 

Ser Glu Leu He Arg Aia Gly Phe Pro Val Asp Leu Asn Ala lie Asn 
885 890 895 
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Phe Pro Arg Giy Cvs Glu Ala Ala Arc Val Gin Val Leu Ser Asp Leu 
900 905 910 

Pro Pro Tyr Pro Trp Asn His Glu Thr Arc Tyr Tro Lys Glu Pro Ara 
915 920 ' * 925 

He Ser Gin Ser Ala Arg Gin Arc Lys Gly Pro Val His Asp Leu He 
930 935 ' 940 

Giy Leu Gin Glu Pro Leu Asn Leu Pro Leu Ala Arg Ser Trp His Asn 
945 950 955 960 

Val Leu Arg Val Ser Asp Leu Pro Trp Leu Arg Asp His Val Val Gly 
965 970 975 

Ser His He Val Phe Pro Giy Ala Gly Phe Val Cys Met Ala Val Met 
980 985 990 

Gly He Ser Thr Leu Cys Ser Ser Asp His Glu Ser Asp Asp He Ser 
995 1000 1005 

Tyr He Leu Arg Asp Val Asn Phe Ala Gin Ala Leu He Leu Pro Ala 
1010 1015 1020 

Asp Gly Glu Glu Gly lie Asp Leu Arg Leu Thr lie Cys Ala Pro Asp 
1025 1030 1035 1040 

Gin Ser Leu Gly Ser Gin Asp Trp Gin Arg Phe Leu Val His Ser lie 
1045 1050 1055 

Thr Ala Asp Lys Asn Asp Trp Thr Glu His Cys Thr Gly Leu Val Arg 
1060 1065 1070 

Ala Glu Met Asp Gin Pro Pro Ser Ser Leu Ser Asn Gin Gin Arg lie 
1075 1080 1085 

Asp Pro Arg Pro Trp Ser Arg Lys Thr Ala Pro Gin Glu Leu Trp Asp 
1090- 1095 1100 

Ser Leu His Arg Val Gly lie Arg His Gly Pro Phe Phe Arg Asn He 
1105 1110 1115 1120 

Thr Cys lie Glu Ser Asp Gly Arg Gly Ser Trp Cys Thr Phe Ala lie 
1125 1130 1135 

Ala Asp Thr Ala Ser Ala Met Pro His Ala Tyr Glu Ser Gin His lie 
1140 1145 1150 

Val His Pro Thr Thr Leu Asp Ser Ala Val Gin Ala Ala Tyr Thr Thr 
1155 1160 1165 

Leu Pro Phe Ala Gly Ser Arg lie Lys Ser Ala Met Val Pro Ala Arg 
1170 1175 1180 

Val Gly Cys Met Lys lie Ser Ser Arg Leu Ala Asp Leu Glu Ala Arg 
1185 1190 1195 1200 

Asp Met Leu Arg Ala Gin Ala Lys "Met His Ser Gin Ser Pro Ser Ala 
1205 1210 1215 

Leu Val Thr Asp Val Ala Val Phe Asp Glu Ala Asp Pro Val Gly Gly 
1220 1225 1230 

Pro Val Met Glu Leu Glu Gly Leu Val Phe Gin Ser Leu Gly Ala Ser 
1235 1240 1245 
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Leu Glv Thr Ser Asd Ara Asp Ser Thr Asp Pre Gly Asn Thr Cys Ser 
1250 '1255 1260 

Ser Trp His Trp Ala Fro Asp He Ser Leu Val Asn Pro Gly Trc Leu 
1265 1270 1275 * 1280 

Glu Lys Thr Leu Glv Thr Gly He Gin Glu His Glu lie Ser Leu He 
1285 1290 1295 

Leu Glu Leu Arg Arg Cys Ser Val His Phe He Gin Glu Ala Met Glu 
1300 1305 1310 

Ser Leu Ser Val Gly Asd Val Glu Arg Leu Ser Gly His Leu Ala Lys 
1315 1320 1325 

Phe Tyr Ala Trp Met Gin Lys Gin Leu Ala Cys Ala Gin Asn Gly Glu 
1330 1335 1340 

Leu Gly Pro Glu Ser Ser Ser Trc Thr Arg Asp Ser Glu Gin Ala Arg 
1345 1350 1355 1360 

Cys Ser Leu Arg Ser Arg Val Val Ala Gly Ser Thr Asn Gly Glu Met 
1365 1370 1375 

He Cys Arg Leu Gly Ser Val Leu Pro Ala lie Leu Arq Arg Glu Val 
1380 1385 ~1390 

Asp Pro Leu Glu Val Met Met Asp Gly His Leu Leu Ser Arg Tyr Tyr 
1395 1400 1405 

Val Asp Ala Leu Lys Trp Ser Arg Ser Asn Ala Gin Ala Ser Glu Leu 
1410 1415 1420 

Val Arg Leu Cys Cys His Lys Asn Pro Arg Ala Arg He Leu Glu He 
1425 1430 1435 1440 

Gly Gly Gly Thr Gly Gly Cys Thr Gin Leu Val Val Asp Ser Leu Gly 
1445 1450 1455 

Pro Asn Pro Pro Val Gly Arg Tyr Asp Phe Thr Asp Val Ser Ala Gly 
1460 1465 1470 

Phe Phe Glu Ala Ala Arg Lys Arg Phe Ala Gly Trp Gin Asn Val Met 
1475 1480 1485 

Asp Phe Arg Lys Leu Asp lie Glu Asp Asp Pro Glu Ala Gin Gly Phe 
1490 1495 1500 

Val Cys Gly Ser Tyr Asp Val Val Leu Ala Cys Gin Val Leu His Ala 
1505 1510 1515 1520 

Thr Ser Asn Met Gin Arg Thr Leu Thr Asn Val Arg Lys Leu Leu Lys 
1525 1530 1535 

Pro Gly Gly Lys Leu lie Leu Val Glu Thr Thr Arg Asp Glu Leu Asp 
1540 1545 1550 

Leu Phe Phe Thr Phe Gly Leu Leu Pro Gly Trp Trp Leu Ser Glu Glu 
1555 1560 1565 

Pro Glu Arg Gin Ser Thr Pro Ser Leu Ser Pro Thr Met Trp Arg Ser 
1570 1575 1580 

Met Leu His Thr Thr Gly Phe Asn Gly Val Glu Val Glu Ala Arg Asp 
1^85 1590 1595 1600 
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Cys Asp ser Hx, Glu Phe Tyr Met He Ser Thr Met Met Ser Thr Ala 

1 biO 1615 

Val Gin Ala Thr Pro Met Sp r r\/<: q^v w-o ~ 

1620 1625 1 " yS Pro ,GIu Val Leu 



1630 



Leu Val Tyr Val Asp Ser Ser Thr Pro Met Ser Trp lie Ser Asp Leu 

1d4u 1645 

Gln^ly Glu He Arg Gly Arg Asn Cys Ser Val Thr Ser Leu Gin Ala 

1DOD 1660 

ll65 Ar9 Pr ° 16 P 7 r S Thr G1U G ^ Gin "e Cys Val Phe Leu Gly 

1675 1680 

Glu Val Glu His Ser Met Leu Gly Ser Val Thr Asn Asp Aso Phe Thr 

1690 1695 

Leu Leu Thr Ser Met Leu Gin Leu Al, Gly Gly Thr Leu Trp Val Thr 

1705 1?10 

Gin Gly Ala Thr Met Lys Ser Asp Asp Pro Leu Lys Ala Leu His Leu 

1/20 1725 

Gly^eu Leu Arg Thr Met Arg Asn Glu Ser His Gly Lys Arg Phe Val 

-L/jo 1740 

Se^Leu Asp Leu top Pro Ser Arg Asn Pro Trp Thr Gly As P Ser Arg 

1755 1760 
Asp Ala He val Ser Val Leu Asp Leu lie Ser „ et Ser Asp Glu Lys 

1/70 1775 

Glu Phe Asp^yr Ala Glu Arg Asp^ly Val lie His Val Pro Arg Ala 



1790 



Phe Ser Asp Ser lie Asn Gly Gly Glu Glu Asp Gly Tyr Ala Leu Glu 

1800 1805 

Pro Phe Gin Asp Ser Gin His Leu Leu Arg Leu Asp lie Gin Thr Pro 

iy15 1820 
Gly_Leu Leu Asp ser Leu His Phe Thr Lys Arg Asn Val Asp Thr Tyr 

1835 1840 
Glu Pro Asp Lys L eu Pro Asp Asp Trp Val Glu lie Glu Pro Arg Ala 

1S50 1855 

Phe Gly Leu Asn Phe Arg Asp He Met Val Ala Met Gly Gin Leu Glu 

1865 187Q 

Ser Asn Val Met Gly Phe Glu Cys Ala Gly Val Val Thr Ser Leu Ser 

1080 1885 

G1U 1890 Pr ° Leu ^ Val Gly Asp Arg Val 

iay5 1900 

CysAla Leu Met Asn Gly His Trp Ala Ser Arg Val Thr Thr Ser Arg 

1915 1920 
Thr Asn Val Val Arg He Pro Glu Thr Leu Ser Phe Pro His Ala Ala 

" 193( j 1935 

Ser lie Pro Leu Ala Phe Thr Thr Ala Tyr He Ser Leu Tyr Thr Val 

iy4i> 1950 
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Ala Arg lie Leu Pro Glv Glu Thr Val Leu lie His Ala Gly Ala Giy 
1955 " 1960 1965 

Gly Val Gly Gin Ala Ala lie lie Leu Ala Gin Leu Thr Gly Ala Glu 
1970 1975 1980 

Val Phe Thr Thr Ala Giy Ser Glu Thr Lys Arg Asn Leu Leu lie Aso 
1985 1990 1995 2000 

Lys Phe His Leu Asp Pro Asp His Val Phe Ser Ser Arg Asp Ser Ser 
2005 2010 2015 

Phe Val Asp Gly lie Lys Thr Arg Thr Ara Giy Lys Gly Val Asp Val 
2020 2025 2030 

Vai Leu Asn Ser Leu Ala Gly Pro Leu Leu Gin Lvs Ser Phe Asp Cys 
2035 2040 2045 

Leu Ala Arg Phe Gly Arg Phe Val Glu lie Gly Lys Lys Asp Leu Glu 
2050 2055 2060 

Gin Asn Ser Arg Leu Asp Met Ser Thr Phe Val Arg Asn Val Ser Phe 
2065 2070 2075 2080 

Ser Ser Val Asp He Leu Tyr Trp Gin Gin Ala Lys Pro Ala Glu He 
2085 2090 2095 

Phe Gin Ala Met Ser Glu Val He Leu Leu Trp Glu Arg Thr Ala He 
2100 2105 2110 

Gly Leu He His Pro lie Ser Glu Tyr Pro Met Ser Ala Leu Glu Lys 
2115 2120 2125 

Ala Phe Arg Thr Met Gin Ser Gly Gin His Val Gly Lys He Val Val 
2130 2135 2140 

Thr Val Ala Pro Asp Asp Ala Val Leu Val Arg Gin Glu Arg Met Pro 
2145 2150 2155 2160 

Leu Phe Leu Lys Pro Asn Val Ser Tyr Leu Val Ala Gly Gly Leu Gly 
2165 2170 2175 

Gly He Giy Arg Arg He Cys Glu Trp Leu Val Asp Arg Gly Ala Arg 
2180 2185 2190 

Tyr Leu He lie Leu Ser Arg Thr Ala Arg Val Asp Pro Val Val Thr 
2195 2200 2205 

Ser Leu Gin Glu Arg Gly Cys Thr Val Ser Val Gin Ala Cys Asp Val 
2210 2215 2220 

Ala Asp Glu Ser Gin Leu Glu Ala Ala Leu Gin Gin Cys Arg Ala Glu 
2225 2230 2235 2240 

Glu Met Pro Pro lie Arg Gly Val He Gin Gly Ala Met Val Leu Lys 
2245 2250 2255 

Asp Ala Leu Val Ser Gin Met Thr Ala Asp Gly Phe His Ala Ala Leu 
2260 2265 2270 

Arg Pro Lys Val Gin Gly Ser Trp Asn Leu His Arg lie Ala Ser Asp 
2275 2280 2285 

Val Asp Phe Phe Val Met Leu Ser Ser Leu Val Gly Val Met Gly Gly 
2290 2295 2300 
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Ala Giv Gin Ala Asn Tvr Ala Ala Ala Gly Ala Phe Gin Asp Ala Leu 
2305 " 231C 2315 2320 

Ala Glu His Arg Met Ala His Asn Gin Pro Ala Val Thr lie Asp Leu 
2325 2330 2335 

Gly Met Val Gin Ser lie Gly Tyr Val Ala Glu Thr Aso Ser Ala Val 
234C ' 2345 2350 

Ala Glu Arg Leu Gin Arg He Gly Tyr Gin Pro Leu His Glu Glu Glu 
2355 2360 2365 

Val Leu Asp Val Leu Glu Gin Ala He Ser Pro Val Cys Ser Pro Ala 
2370 2375 2380 

Ala Pro Thr Arg Pro Ala Val He Val Thr Gly He Asn Thr Arg Pro 
2385 2390 2395 2400 

Gly Pro His Trp Ala His Ala Asp Trp Met Gin Glu Ala Arg Phe Ala 
2405 2410 2415 

Gly He Lys Tyr Arg Asp Pro Leu Arg Asp Asn His Gly Ala Leu Ser 
2420 2425 2430 

Leu Thr Pro Ala Glu Asp Asp Asn Leu His Ala Arg Leu Asn Arg Ala 
2435 2440 2445 

He Ser Gin Gin Glu Ser He Ala Val He Met Glu Ala Met Ser Cys 
2450 2455 2460 

Lys Leu He Ser Met Phe Gly Leu Thr Asp Ser Glu Met Ser Ala Thr 
2465 2470 2475 2480 

Gin Thr Leu Ala Gly lie Gly Val Asp Ser Leu Val Ala He Glu Leu 
2485 2490 2495 

Arg Asn Trp lie Thr Ala Lys Phe Asn Val Asp He Ser Val Phe Glu 
2500 2505 2510 

Leu Met Glu Gly Arg Thr lie Ala Lys Val Ala Glu Val Val Leu Gin 
2515 2520 2525 

Arg Tyr Lys Ala 
2530 

<210> 11 
<211> 249 
<212> PRT 

<213> Aspergillus terreus 
<400> 11 

Met Ala Thr Gin Glu Phe Leu Ser Asp Val Ser Ser Gly Phe Leu Ser 
15 10 15 

Ala Glu Ala lie Ara Tyr Arg Val Lys Thr Gly Val Ser Met Asp Gly 
20 25 30 

Trp Met Lys Arg Gly Tyr Ser Cys Asn Ser Val Arg Thr Asp Asp Lys 
35 40 45 

His His Leu Arg His Leu Thr Asn lie Gly Leu Asp Thr Pro Pro Cys 
50 55 60 

Pro Lys Ser Leu Pro Ala Ala His Ser Ala Val Ala Ser Cys Leu Thr 
65 70 75 80 
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Phe Val Pre Fro Asp Pro Cvs Glu Asn Tro Glu Ale Leu Gin Vai Ala 
85 " 90 95 

Trp Asp Lys Ala Cys Cys Arg Asn Pro Thr Pro Leu Phe Phe He Cys 
100 ' 105 110 

Val Ser Leu Leu Phe Ser Phe Tvr Ser Leu Trp Leu Gin Arg Gly Gly 
115 120 125 

Cvs Gly Arg Tyr Giv Gly Leu His Arg Val Ser Lys Val Phe Pro Lys 
130 - 135 140 

Val Trp Pro Asp Aso Met Asp Ser Gin Leu Pro Ser Arg Leu Gin Thr 
145 150 155 160 

Leu Val Ser Lys Arg Lys Pro Glu Pro Ala Pro Asn Asn Ser Thr Tyr 
165 170 175 

He Ser Lys Gly Tyr Ala Thr Phe Phe Asn Gin Phe Ser Leu Pro Ser 
180 185 190 

Val Asp Val Thr Gin He Leu Asn Gin Thr Leu Gin His His Asp Val 
195 200 205 

Glu Thr He Asn Leu Asp Cys Gly Ser Gly Leu Leu Thr Leu Arg Thr 
210 215 220 

Gin Leu Arg lie Leu Leu He Gly Lys Pro Lys He He Lys Pro Phe 
225 230 235 240 

Ser Gly Leu Arg Thr Ser He Asn Glu 
245 

<210> 12 
<211> 742 
<212> PRT 

<213> Aspergillus terreus 
<400> 12 

Met Glu Ser Ala Glu Leu Ser Ser Lys Arg Gin Ala Phe Pro Ala Cys 
15 10 15 

Asp Glu Cys Arg lie Arg Lys Val Arg Cys Ser Lys Glu Gly Pro Lys 
20 25 30 

Cys Ser His Cys Leu Arg Tyr Asn Leu Pro Cys Glu Phe Ser Asn Lys 
35 40 45 

Val Ala Arg Asp Val Glu Lys Leu Gly Ser Arg Val Gly Asp He Glu 
50 55 60 

His Ala Leu Gin Ara Cys Leu Ser Phe He Asp Ala His Gin Gly Phe 
65 70 75 80 

Arg Asp Leu Ser Arg Pro Gin Ser Gin Glu Ser Gly Tyr Thr Ser Ser 
85 90 95 

Thr Ser Ser Glu Glu Cys Glu Val Asn Leu Tyr Ser Gly Lys His Thr 
100 105 110 

Ser Pro Thr Glu Glu Asc Gly Phe Trp Pro Leu His Gly Tyr Gly Ser 
115 * 120 125 

Phe Val Ser Leu Val Met Glu Ala Gin Ala Ala Asn Ala Asn Leu Thr 
130 135 140 
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Ser Trp Leu Pro Val Asp Met Thr Ser Glv Gin Val Ala Glu Met Val 
145 150 - 155 160 

Ala Phe Asp Arg Gin Ala Val Ser Ala Val Arg Ser Lys Val Ala Glu 
165 170 175 

Ala Asn Glu Thr Leu Gin Gin lie He Glu Asp He Pro Thr Leu Ser 
180 185 190 

Ala Ser Glu Asn Asp Thr Phe Leu Pro Ser Leu Pro Pro Arg Ala Leu 
195 200 205 

Val Glu Pro Ser He Asn Glu Tyr Phe Lys Lys Leu His Pro Arg Leu 
210 215 220 

Pro He Phe Ser Arg Gin Thr lie Met Asp Ala Val Glu Ser Gin Tyr 
225 230 235 240 

Thr lie Arg Thr Gly Pro Pro Asp Leu Val Trp lie Thr Ser Phe Asn 
245 . 250 255 

Cys He Val Leu Gin Ala Leu Thr Gin Thr Ser He Ala Asn Lys Val 
260 265 270 

Val Gly Cys Thr Gly Gin Asp He Pro He Asp Tyr Met He He Ser 
275 280 285 

Leu Leu Arg Asn lie Arg Gin Cys Tyr Asn Arg Leu Glu Thr Leu Val 
290 295 300 

Lys Pro Arg Leu Ser Asn lie Arg Ala Leu Phe Cys Leu Ala Leu Val 
305 310 315 320 

Ala Met Glu Tyr Phe Asp Phe Ala He Phe Leu Thr He Phe Ala Gin 
325 330 335 

Val Cys Glu Leu Ser Arg Leu lie Gly Leu His Leu Thr Thr Thr Thr 
340 345 350 

Pro Pro Thr Glu Asp Gly Ala Val Gly Asp Gin Pro Lys Asp Leu Phe 
355 360 365 

Trp Ser lie Phe Leu Val Asp Lys His Val Ser He lie Gly Gly Lys 
370 375 380 

Ala Cys Leu Leu Pro Ser Tyr Asp Cys Ser Val Pro Leu Pro Pro Tyr 
385 390 395 400 

Asp Ser Ala Ala Pro Leu Pro Asn Ala Phe Ala Ala Arg He Arg Leu 
405 410 415 

Ala Phe lie Leu Glu Glu lie Tyr Leu Gly Leu Tyr Ser Ala Lys Ser 
420 425 430 

Ser Lys Met Glu Gin Ser Arg Val Arg Arg Arg lie Arg Arg lie Ala 
435 440 445 

Arg Lys Leu Ser Gin Trp His Val Gin His Glu His Val Leu Arg Thr 
450 455 460 

Gly Asp Pro Asn Arg Pro Leu Glu Glu Tyr lie Cys Ala Thr Gin Leu 
465 470 475 480 

Arg Phe Ala Leu Ser Ser Cys Trp Val Leu Leu His Lys Arg He Trp 
485 490 495 
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Ser Gin Glu Arg Gly Ala Val Cys Leu Gin His Ala Arg Asp Cys Leu 
500 505 510 

Met Leu Phe Lys Gin Leu Cys Asp Gly Cys Lys Ser Gly Phe Ser Asn 
515 520 525 

Phe Asp Ser He Val Leu Asn Tyr Ser Leu He Ser Phe Met Gly He 
530 535 540 

Tyr Val His He Val Glu Glu Asp Gin Pro He His Ser Gin Asp Met 
545 550 555 560 

Glu lie Leu Thr Phe Phe Ala He Tyr Thr Asn Arg Ser Ala Ser Asn 
565 570 575 

Arg Ser Ser Ala Ser lie Ser Tyr Lys Leu Ser Gin Val Ala Ser Arg 
580 585 590 

Cys Ser Asp lie Ala Leu Leu Leu Gin- Asn Leu Arg Glu Arg Arg Phe 
595 600 605 

He Pro Thr Thr lie Ser Arg Ser Pro Thr Pro Ser Trp Asn Glu Pro 
610 615 620 

Thr Tyr Met Asp Tyr Asp Val Ala Asn Ala Ser Thr Ser Thr Thr Ser 
625 630 635 640 

Thr Gly Ser Ser Tyr Asn Leu Asn lie Ser Pro Leu Gly Val Pro Gly 
645 650 655 

Asp Gly Gin Val Trp Asp lie Tyr Phe Asn Pro Arg Glu He Pro Met 
660 665 670 

Asp Gly Thr He Ala Thr Pro Ser Glu Asp Ala Thr Gin Asp Leu Leu 
675 680 685 

Ser Asn Asp Ala Gly Gin Cys Leu Gly Phe Pro Asp Phe Ser Leu Gly 
690 695 700 

He Asp Asn Phe Ser Asp Phe Pro Leu Gly lie Asp Met Thr Ser Gin 
705 710 715 720 

Ser Glu Phe Gly Leu lie Met Glu Glu Asp lie lie Arg Tyr Glu Arg 
725 730 735 

Leu Leu Asp Arg Pro Val 
740 

<210> 13 
<211> 301 
<212> PRT 

<213> Aspergillus terreus 
<400> 13 

Met Glu Ser Lys Val Gin Thr Asn Val Pro Leu Pro Lys Ala Pro Leu 
15 10 15 

Thr Gin Lys Ala Arg Gly Lys Arg Thr Lys Gly lie Pro Ala Leu Val 
20 25 30 

Ala Gly Ala Cys Ala Gly Ala Val Glu He Ser lie Thr Tyr Pro Phe 
35 40 45 

Glu Ser Ala Lys Thr Arg Ala Gin Leu Lys Arg Arg Asn His Asp Val 
50 55 60 
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Ala Ala lie Lvs Pro Gly lie Arc Glv Trc Tyr Aia Gly Tvr Giv Ala 
65 - 70 75 80 

Thr Leu Val Gly Thr Thr Leu Lys Ala Ser Val Gin Phe Ala Ser Phe 
85 90 95 

Asp. lie Tyr Arg Ser Ala Leu Ser Gly Pro Asn Gly Glu Leu Ser Thr 
100 105 110 

Glv Ala Ser Val Leu Aia Gly Phe Gly Aia Gly Vai Thr Glu Aia Va i 
115 120 125 

Leu Ala Val Thr Pro Ala Glu Ala He Lys Thr Lys He He Asd Ala 
130 135 140 

Arg Lys Val Gly Asn Ala Glu Leu Ser Thr Thr Phe Gly Ala He Ala 
145 150 155 160 

Gly He Leu Arg Asp Arg Gly Pro Leu Gly Phe Phe Ser Ala Val Gly 
165 1"?C - ' 175 

Pro Thr He Leu Arg Gin Ser Ser Asn Ala Ala Val Lys Phe Thr Val 
180 185 190 

Tyr Asn Glu Leu lie Gly Leu Ala Arg Lys Tyr Ser Lys Asn Giv Glu 
195 200 205 

Asp Val His Pro Leu Ala Ser Thr Leu Val Gly Ser Val Thr Gly Val 
210 215 220 

Cys Cys Ala Trp Ser Thr Gin Pro Leu Asp Val He Lys Thr Arg Met 
225 230 235 240 

Gin Ser Leu Gin Ala Ara Gin Leu Tyr Gly Asn Thr Phe Asn Cys Val 
245 250 255 

Lys Thr Leu Leu Arg Asn Glu Gly He Gly Val Phe Trp Ser Gly Vai 
260 265 270 

Trp Phe Arg Thr Gly Arg Leu Ser Leu Thr Ser Ala He Met Phe Pro 
275 280 285 

Val Tyr Glu Lys Val Tyr Lys Phe Leu Thr Gin Pro Asn 
290 295 300 



<210> 14 
<211> 490 
<212> PRT 

<213> Aspergillus terreus 
<400> 14 

Met Thr Lys Gin Ser Ala Asp Ser Asn Ala Lys Ser Gly Val Thr Ala 
15 10 15 

Glu He Cys His Trp Ala Ser Asn Leu Ala Thr Asp Asp He Pro Pro 
20 25 30 

Asp Val Leu Glu Arg Ala Lys Tyr Leu lie Leu Asp Gly lie Ala Cys 
35 40 45 

Ala Trp Val Gly Ala Arg' Val Pro Trp Ser Glu Lys Tyr Val Gin Ala 
50 55 60 

Thr Met Ser Phe Glu Pro Pro Gly Ala Cys Arg Val He Glv Tyr Gly 
65 70 75 80 
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Gin Lys Leu Gly Pro Val Ala Ala Ala Met Thr Asn Ser Ala Phe lie 
c ~. 90 c = 

Gin Ala Thr Giu Leu Asc Asp Tvr His Ser Glu Ala Pro Leu His Ser 
100 105 110 

Ala Ser lie Val Leu Pro Ala Val Phe Ala Ala Ser Glu Vai Leu Ala 
115 120 125 

Glu Gin Gly Lys Thr lie Ser Gly lie Ala Vai lie Leu Ala Ala lie 
130 135 140 

Val Gly Phe Glu Ser Gly Pro Arg lie Gly Lys Ala lie Tyr Glv Ser 
145 150 155 " 160 

Asp Leu Leu Asn Asn Gly Trp His Cys Gly Ala Val Tyr Gly Ala Pro 
165 170 175 

Ala Gly Ala Leu Ala Thr Gly Lys Leu- Leu Gly Leu Thr Pro Asp Ser 
180 185 190 

Met Glu Asp Ala Leu Gly He Ala Cys Thr Gin Ala Cys Gly Leu Met 
195 200 205 

Ser Ala Gin Tyr Gly Gly Met Val Lys Arg Val Gin His Gly Phe Ala 
210 215 220 

Ala Arg Asn Gly Leu Leu Gly Gly Leu Leu Ala His Gly Gly Tyr Glu 
225 230 235 240 

Ala Met Lys Gly Val Leu Glu Arg Ser Tyr Gly Gly Phe Leu Lys Met 
245 250 255 

Phe Thr Lys Gly Asn Gly Arg Glu Pro Pro Tyr Lys Glu Glu Glu Val 
260 265 270 

Val Ala Gly Leu Gly Ser Phe Trp His Thr Phe Thr He Arg He Lvs 
275 280 285 

Leu Tyr Ala Cys Cys Gly Leu Val His Gly Pro Val Glu Ala He Glu 
290 295 300 

Asn Leu Gin Arg Arg Tyr Pro Glu Leu Leu Asn Arg Ala Asn Leu Ser 
305 310 315 320 

Asn He Arg His Val His Val Gin Leu Ser Thr Ala Ser Asn Ser His 
325 330 335 

Cys Gly Tro lie Pro Glu Glu Arg Pro He Ser Ser He Ala Gly Gin 
340 ^ 345 350 

Met Ser Val Ala Tyr He Leu Ala Val Gin Leu Val Asp Gin Gin Cys 
355 360 365 

Leu Leu Ala Gin Phe Ser Glu Phe Asp Asp Asn Leu Glu Arg Pro Glu 
370 375 380 

Val Trp Asp Leu Ala Arg Lys Val Thr Pro Ser His Ser Glu Glu Phe 
385 390 395 400 

Asp Gin Asp Gly Asn Cys Leu Ser Ala Gly Arg Val Arg He Glu Phe 
405 410 415 

Asn Asp Gly Ser Ser Val Thr Glu Thr Val Glu Lys Pro Leu Gly Val 
420 425 430 
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Lvs G.lu Pre Met Pre Asn Glu Ara lie 

4 3 5 4 4 0 

Ala Gly Ser Vai Tr.r Asc Glu Thr Ara 

450 * 455 

Val Leu Ser Leu Asd Arc Leu Thr Asp 

465 470 

Leu Asn Cys Pro Vai Lys Ser Pro Leu 
485 



PCT/US99/29583 

Leu His Lys Tyr Arg Thr Leu 
4 4 5 

Val Lys Glu He Giu Asp Leu 
4 60 

He Ser Pro Leu Leu Glu Leu 
475 480 

Val 
4 90 



<210> 15 
<211> 488 
<212> PRT 

<213> Aspergillus cerreus 
<400> 15 

Met Gly Arg Gly Asp Thr Glu Ser Pro Asn Pro Ala Thr Thr Ser Glu 
1 *5 10 15 

Gly Ser Gly Gin Asn Glu Pro Glu Lys Lys Gly Arg Asp He Pro Leu 
20 25 30 

Trp Arg Lys Cys Val lie Thr Phe Val Val Ser Trp Met Thr Leu Val 
35 40 45 

Val Thr Phe Ser Ser Thr Cys Leu Leu Pro Ala Ala Pro Glu He Ala 
50 55 60 

Asn Glu Phe Asp Met Thr Val Glu Thr He Asn He Ser Asn Ala Gly 
65 70 75 80 

Val Leu Val Ala Met Glv Tyr Ser Ser Leu He Trp Gly Pro Met Asn 
85 90 95 

Lvs Leu Val Gly Arg Ara Thr Ser Tyr Asn Leu Ala He Ser Met Leu 
100 105 110 

Cys Ala Cys Ser Ala Gly Thr Ala Ala Ala lie Asn Glu Lys Met Phe 
115 120 125 

He Ala Phe Arg Val Leu Ser Gly Leu Thr Gly Thr Ser Phe Met Val 
130 135 140 

Ser Gly Gin Thr Val. Leu Ala Asp He Phe Glu Pro Val Tyr Arg Gly 
145 150 155 160 

Thr Ala Val Gly Phe Phe Met Ala Gly Thr Leu Ser Gly Pro Ala He 
165 170 175 

Gly Pro Cys Val Gly Gly Val He Val Thr Phe Thr Ser Trp Arg Val 
180 185 190 

He Phe Trp Leu Gin Leu Gly Met Ser Gly Leu Gly Leu Val Leu Ser 
195 200 205 

Leu Leu Phe Phe Pro Lys He Glu Gly Thr Ser Glu Lys Val Ser Thr 
210 215 220 

Ala Phe Lys Pro Thr Thr Leu Val Ser He He Ser Lys Phe Ser Pro 
225 230 235 240 

Thr Asp Vai Leu Lys Gin Trp Val Tvr Pro Asn Vai Phe Leu Ala Val 
245 250 ■ 255 
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Ser Ala Trp Glu He Cys Fro Leu His Leu Leu Glu Thr Lys Cys Ser 
260 265 270 

Cys Arg Lvs Gin Lys Asp Leu Cys Cys Gly Leu Leu Ala He Thr Gin 
275 280 285 

Tyr Ser He Leu Thr Ser Ala Arg Ala He Phe Asn Ser Arg Phe His 
290 295 300 

Leu Thr Thr Ala Leu Val Ser Gly Leu Phe ~vr Leu Ala Pro Gly Ala 
305 310 315 320 

Gly Phe Leu lie Gly Ser Leu Val Gly Giy Lys Leu Ser Asp Arg Thr 
325 * 330 335 

Val Arg Arg Tyr He Val Lys Arg Gly Phe Arg Leu Pro Gin Asp Arg 
340 345 350 

Leu His Ser Gly Leu lie Thr Leu Phe. Ala Val Leu Pro Ala Gly Thr 
355 360 365 

Leu He Tyr Gly Trp Thr Leu Gin Glu Asp Lys Gly Gly Met Val Val 
370 375 380 

Pro lie lie Ala Ala Phe Phe Ala Gly Trp Gly Leu Met Gly Ser Phe 
385 390 395 400 

Asn Cys Leu Asn Thr Tyr Val Ala Val Glu Ala Leu Pro Arg Asn Arg 
405 410 415 

Ser Ala Val lie Ala Gly Lys Tyr Met lie Gin Tyr Ser Phe Ser Ala 
420 425 430 

Gly Ser Ser Ala Leu Val Val Pro Val lie Asp Ala Leu Gly Val Gly 
435 440 445 

Trp Thr Phe Thr Leu Cys Val Val Ala Ser Thr lie Ala Gly Leu lie 
450 455 460 

Thr Ala Ala lie Ala Arg Trp Gly lie Asn Met Gin Arg Trp Ala Glu 
465 470 475 480 

Arg Ala Phe Asn Leu Pro Thr Gin 
485 



<210> 16 
<211> 516 
<212> PRT 

<213> Aspergillus terreus 
<400> 16 

Met Thr Leu Gin lie lie Val lie Ala Ala Thr Ala Val lie Tyr Phe 
15 10 15 

Leu Thr Arg Tyr Phe Asn Arg Thr Asp lie Pro Lys lie Lys Gly lie 
20 25 30 

Pro Glu lie Pro Gly Val Pro lie Phe Gly Asn Leu lie Gin Leu Gly 
35 40 45 

Val Lys His Ala Thr Val Ala Arg Lys Trp Ser Lys Glu Phe Gly Pro 
50 55 60 

Val Phe Gin Ala Arg Leu Gly Asn Arg Arg Val lie Phe Ala Asn Thr 
65 70 75 80 



31 



3NSDOCID: <WO 0037629A2 I > 



WO 00/37629 PCT/US99/29583 

Phe Glu Ser Thr Arg Gin Leu Trp lie Lvs Glu Gin Ser Ser Mez lie 
85 90 95 

Ser Arg Pro Thr Phe His Thr Phe His Gly Val Val Ser Ser Ser Gin 
100 105 110 

Gly Phe Thr lie Gly Thr Ser Pro Trp Asp Glu Ser Cys Lys Arg Ara 
115 120 125 

Arg Lys Ala Ala Ala Thr Ala Leu Asn Arg Val Ala Vai Gin Ser Tyr 
130 135 140 

Met Pro lie lie Asp Leu Glu Ser Met Ala Ser lie Lys Glu Leu Leu 
145 150 155 160 

Lys Asp Ser Gin Gly Gly Lys lie Asp He Asn Pro Thr Pro Tyr Phe 
165 170 175 

Gin Arg Phe Ala Leu Asn Thr Ser Leu - Thr Leu Asn Tyr Gly Tyr Arg' 
180 185 190 

He Glu Gly Asn Val Asn Asp Gin Leu Leu Arg Glu He Cys Glu Val 
195 200 205 

Gin Arg Gly Val Ala Asn Leu Arg Ser Thr Ser Asn Asn Trp Gin Asp 
210 215 220 

Tyr Val Pro Leu Leu Arg Leu Phe Ser Asn Arg Ser Asn Gin Ala Lys 
225 230 235 240 

His Leu Arg Ala Arg Arg Asp Lys Tyr Met Ala Phe Leu *Phe Asp He 
245 250 255 

Leu Lys Asp Arg Met Ala Lys Gly Thr Asp Lys Pro Cys He Thr Gly 
260 265 270 

Asn He Leu Lys Asn Pro Glu Thr Lys Leu Thr Asd Ala Glu He Lys 
275 280 ' 285 

Ser He Cys Leu Thr Met Val Ser Ala Gly Leu Asp Thr Val Pro Gly 
290 295 300 

Asn Leu lie Met Gly lie Ala Tyr Leu Ser Ser Glu Asp Gly Gin Arg 
305 310 315 320 

He Gin Gin Lys Ala Tyr Glu Glu lie Met Ser Vai Tyr Pro Asn Gly 
325 330 335 

Asp Ala Trp Glu Arg Cys Leu Val Glu Glu Lys Val Pro Tyr lie Thr 
340 345 350 

Ala Leu Val Lys Glu Thr Leu Arg Phe Trp Thr Val Met Pro lie Cys 
355 360 365 

He Pro Arg Val Asn lie Lys Glu Val lie Tyr Asn Gly Ala Arg lie 
370 375 380 

Pro Ala Gly Thr Thr Phe Phe Met Asn Ala Trp Ala Ala Asn Tyr Asp 
385 390 395 400 

Glu Asp His Phe Asp Met Pro Asn Arg Phe Leu Pro Glu Arg Tyr Leu 
405 410 415 

Glu Pro Ser Glu Gly Phe Gly Thr Pro His Tyr Ser Phe Gly Ala Gly 
420 425 430 
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Thr Arg Met Cys Ala Ala Ser His Leu Ala Ser Arg Glu Leu Tyr Thr 
4 35 440 445 

Val Phe Leu Arg Fhe lie Val Ala Phe Thr He Glu Pro Ala Gin Asn 
450 455 460 

Pro Ala Asp Met Pro Val Leu Asp Ala He Glu Cys Asn Ala Thr Pro 
465 470 475 480 

Thr Ser Met Thr Thr Glu Pro Lys Pro Phe Lys Val Gly Phe Lys Pro 
485 490 495 

Arg Asp Glu Thr Ser Leu Arg Arg Trp He Ala Glu Ser Glu Glu Arg 
500 505 510 

Thr Lys Glu Leu 
515 

<210> 17 
<211> 481 
<212> PRT 

<213> Aspergillus terreus 
<400> 17 

Met Lys Pro Ala He Leu Met Lys Tyr Trp Leu Phe Val Ser Ala Val 
15 10 15 

Ser Ala Ser Thr Leu Asn Gly Lys Leu Thr Leu Ser Glu Thr Lys Val 
20 25 30 

Thr Gly Ala Val Gin Leu Ala Cys Thr Asn Ser Pro Pro Asp lie Tyr 
35 40 45 

He. Asp Pro Asp Asp Ser Val Ser Val Val Arg Ala Ala His Asp Leu 
50 55 60 

Ala Leu Asp Phe Gly Arg Val Phe Gly Lys Asn Ala Thr Val Arg Phe 
65 70 75 80 

Thr Asn Glu Thr His Pro Thr Ser Met Ala lie lie Ala Gly Thr lie 
85 90 95 

Asp Lys Ser Thr Phe Leu Gin Arg Leu lie Ala Asp His Lys Leu Asp 
100 105 110 

Val Thr Ser lie Arg Gly Gin Trp Glu Ser Tyr Ser Ser Ala Leu Val 
115 120 125 

Leu Gly Pro Ala Lys Gly lie Gin Asn Ala Leu Val lie Ala Gly Ser 
130 135 140 

Asp Arg Arg Gly Ala He Tyr Gly Leu Tyr Asp He Ser Glu Gin lie 
145 150 155 160 

Gly Val Ser Pro Leu Phe Trp Trp Thr Asp Val Thr Pro Thr Lys Leu 
165 170 175 

Asp Ala lie Tyr Ala Leu Asp Val Gin Lys Val Gin Gly Pro Pro Ser 
180 185 190 

Val Lys Tyr Arg Gly lie Phe lie Asn Asp Glu Ala Pro Ala Leu His 
195 200 205 

Asn Trp He Leu Ala Asn Tyr Gly Glu Val Glu Asn Gly Asp Pro Ala 
210 215 220 
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Phe lie Ser Arc Phe Tyr Ala His Val Phe Glu Leu lie Leu Arc Leu 
22 5 23 C 23 5 24 0 

Lys Gly Asn Tyr Leu Trp Pro Ala Met Tro Ser Asn Me" Phe Tyr Val 
245 250 255 

Asp Asp Thr A.sn Asn Gly Pro Leu Ala Asp Tyr Tyr Gly Val Val Met 
260 265 270 

Gly Thr Ser His Thr Gly Met Thr Val Gly Thr Pro Cys Leu Lys Ala 
275 280 285 

His Ala Aso Tvr Glu Lys Glu Pro Met Ala Arg Ala Thr Asn Glu Gin 
290 295 300 

Ser Gin Phe Leu Asn Gly Thr Trp Asp Tro lie Ser Asn Glu Val Asn 
305 310 315 320 

Val Lys Ala Phe Met Arg Glu Gly Val lie Ara Ser Gin His Trp Glu 
325 330 . " 335 

Thr Ala Tyr Thr Met Gly Met Arg Gly Leu Gly Asp Ala Ala Ser Pro 
340 345 350 

Thr Leu Asn Ala Thr Val Glu Glu Ser He Val Ser Trp Gin Glu Ser 
355 360 365 

Val Leu Ser Asp He Leu Asn Lys Thr Asn Leu Ser Asn Val Val Gin 
370 375 380 

Pro Phe Val Leu Phe Asp Glu Leu Gly Thr Tyr Tyr Glu Ser Gly Met 
385 390 395 400 

Thr Val Pro Asp Gin Val Thr Leu He Tyr Pro Asp Asp Asn Ala Gly 
405 410 415 

Asn Met Leu Arg Leu Pro Leu Gin Asn Glu Thr Gly Arg Ser Gly Gly 
420 425 430 

Ala Gly He Tyr Tyr His Phe Asp Met Asn Ala Pro Pro Arg Cys Tyr 
435 440 445 

Lys Trp He Asn Thr Ala Gin Leu lie Arg Thr Trp Asp Gin Leu Arg 
450 455 460 

Ala Ala Tyr Ser His Gly Ala Gin Thr Val Trp Val Ala Asn He Gly 
465 470 475 480 

Asp 



<210> 18 
<211> 33000 
<212> DNA 

<213> Aspergillus terreus 
<400> 18 

tggattttct tcctgtaggc ccgtagctat 
aatattagaa actgctcgcg tatcttatcc 
gttttagcaa gctggacggc ctgcagggcg 
gaaagatagc ttatatagct ataaggcttt 
aaaataataa gaaatctaat aagagtactt 
agtaagataa ttaaaaatat taaacaggcc 
tgattaacgc gctacaatag tttasgatct 
cggctataaa tttttatcta attaagcgcg 
agtaaaatta gctggttaqt ccggctacta 



gtaatctagc taaacagagc gcgtatttta 60 
agagcgttag ctaggtaggt tacctggtct 120 
actaatattt aggctatttt tataagcccg 180 
agaaagatct actgcttaat atctatttct 240 
ttaaagagat ctttcttaag agtatggtcg 300 
taattaagca gttctttagt ttgctgctgc 360 
tagctttaga ttaggagatt aactagctgc 420 
gtaaactagg cagtatttag ctagtggcgg 480 
tggtaggcga agtaaataag acactgctag 540 
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azczaczaqz actaacacza cc zccczacc cczacazaqa zczaqaz zaq ctgcttttag 600 
azcccccqcz ccccoacgaa CclcUccIc icccitcaci: coaatccstL cccccagccc 66C 
;;c:iaa::: cctaatacct czaaz za~ z z agzzcczcqa cqczaqcaqz aaaucacttt 72C 
actaaaca:: cagc:aiaco eciaaaaacc c:accgagia gggcgactgc gg t t caacct 780 
zaszz azaaa qq z a z z cc qq cz a l z aaaz a aqczczagcz az aaqqa aa z tctagitaga £4 0 
t c t a a a t z a a zaazaqaqaz. czaqzcqzza q zcczazccq cociacccia cttititttict 9CC 
acagcctcc; ccaccccacc: czcaazaazz azaqczaaqc ccc tiageca qacqqczzzc 96C 
tctagtcttt aatuacaaca accc t ccc z a : a r. a : : 5 c : : caaacaa: la ac;a :-* :acai 102C 
tacciga eaa agaaccccca aiagctaaca aza z aczczq zz aqzcqqcc gcgaacccca 1080 
qczzazazzz aagcaatact czzczaaczc tat:::c:[c gtoccctatt aeaatcagta 1140 
zaqzzzzaaz tttaatattt acuatcctc cgtccgcact aatagata;a cttaiaatat 1200 
acgcagc'at aactacgqta gactggaaca cccaaaarca cagagctac: tagagggggg 3.260 
aaaraaataa tctcccactt tagattaatt tagagctgcc gaagtattac agttaaagca 1320 
gcttgttaga gggccggaac agtggcatta gaacaaaagc tataacgcgt ttggaggtag 1380 
a zaa zaaaaq tagtagaaga aactatatac tagcaggaag gtgtagtata gatctagatc 1440 
ttataggtta agtcatagag aaacagctct attgttaatt ctaggctcta agagaaaagt 1500 
acctagagag t taaaaa zaa aggaaaatag gtttcraraa ggagctattt tctttttaat 1560 
atttatta:a ttttttaaag atatattaat ggcgcctcgg a tacacg tag taaaaagtaa 1620 
at tcgtgt ct. cctactgctt attcctgaag ataaaaia la gatataatcc cgctaggtcc 1680 
tctttaaata agacaqaacg gcggagcugt ccgttcgggg cacgcatgna aggccacggg 1~4Z 
agcacggcga aatattaaat cicgaccaat tagcaggcga gaaaatggat cgaaggtcgg 1800 
gtgaacttgg gcctagggac taggcaacca ccaagagaca tcctggctac tatagtccct 1660 
attggcaatg gcctgattcg ctcggtccaa gctctgcgcg atcaaactcg acgagttcgc 1920 
ct tgacgccc accgcaacgt cacacatgaa a acccca tag ggatccgggt cagatcccag 1980 
cgcatcctgg cacgtatccg catcgggcag ccggatcgat ccggagttca taagcgcggt 2040 
aatttcgtcr gtctcggtcg cgttcgtgct ctcgcctgtc gtgtctccaa caatcgcgat 2100 
atcgggccat gcggatagga agaagtaagt gtttgggttg tcggggtccg gttcttgggt 2160 
tccgggctcg caccagcggt ggccgtcaaa tcgcgcgtcc atatcgatgt aatggatctg 2220 
ctccgtatte cgggcggtgt tgatgtcgga aatgctggat tggattaacg agttgagcat 2280 
gcggacgacg tcgttgagci cc 1 1 gcg tag g t cgc t cgt g agccacacac agctgccgga 2340 
ttgctgcggc tcgtagtgtg ggctttcgta ccagaaggtg gttgagtcgc agtcggtggt 2400 
gtcttcgttg aagaaggtga catagccagg gacgtagagg ttgaacgact aaagtaaagt 2460 
cagcgcgacg aaggacagca cgacaggaat aaataaggat aacagggaga tatacgctag 2520 
agccagatc: atccaggatc ttttcatacg cagccctaag tttcgagccg agtccgtcag 258C 
accctgtatc ctgcatcagg ttgcgggctt tctcttcagt ctccagacac cactggcggt 2640 
attgctcaag cgagtaccac cacattgtta gcacccagtt gqaaaccaga tcgctgaaga 2700 
acacatcgtr ccctccaatt gtcagggttg ccatggtagt ccccgtgggg tcctgtccta 2760 
gccactggtc gatcttttta ttcagcccaa ccgttgtatc tccagagcac gcatagttgg 2820 
tataagtgaa atcctcagta tcaaaccact cctggacgag ctttccgtag ctgttacttc 2880 
ccacgcggca actgtcaccg gttgtggttc ctgtacccat cccagcggcg taggagtcac 2940 
cgaagtgaac ataatcgttg acgccgtaca cagtgtcagc atcataagac gaatcatcgt 3000 
aggccggaca gctggagtca tccgccgaag aagcagactc ctcaggaagg ggaatctgtc 3060 
gaccacatat cttggtatag tacccgccaa gtacgacaag agcccatgaa tccgcattgt 3120 
aggcggacaa cttgctgtca cagatgcctc cttgcccgag gttatttggg tctgggcaaa 3180 
gcaatctgcg ttgccagtta gcatcgccgg ctatacggag ctcattagcg gctctttcag 3240 
cctttctgcc tcttccgata cagtgtcgat tgaaggrccc tgcggcaaga tctcgagctc 3300 
ctttccatcc atagacataa tcgcgggtcc tcctaagqtg aaagaatcag tatcggcctg 3360 
ggaatgagaa aaaatatgtg ataccaccca ttctctccat tgacaatctc cataacatag 3420 
tccgtatgcg ttacttcgtg caaaatcgcc ccggctcgag tgagatgggc atcatgaaga 3480 
ttaattgagt cgcacctcgc gagcactgac ctggaactct tcaactcatc tgtgaagaac 3540 
tttgggcaga aattaagccg attcctattg ttcatcatgg ccaaccagcc gttctcgtca 3600 
cagccccgaa ggtttttgca agtgatcgtg aacttttcgt cgtcgaattg cgaagaccca 3660 
gaaatcatgg tgaccatccg gtcgaacgta taccttacat ttatagtgaa ggtcaagctg 3720 
tcggtgagtg attcggcaaa gaagatgtcc ctacatgggt cgtaigcgcc gccgctaaag 3780 
cacgcaactt ccgaggcagc atcaagcgca tattgcattt cttcaaagag cgtctgcacc 3840 
cgggccctag tgcactcgct gttgtcctga taatgagggg tgagctcgat gggggctttt 3900 
acaggagata caggtagatt ggggttagtt atcgggccgt catcgcctgt gtggtctttg 3960 
gtggcacatr cgttgtacca cctctgcgct gcgttcttta tggagttcca gatgctgtgc 4020 
atgtctgcac gaecgtcgcc gttcagatca gggaaanaag tgcatgagcc ttcaaccgcc 4060 
cctcggtact tgggaccctg cggtccccat tcgtaccgcg atcccttgat atcacgttgg 4140 
ccaaggttgc accacaccga cccatctccc gagaacctgt ctgtccagac catatcggcc 4200 
tttccgtcgc cgttgacgtc ggcccagtgt agattcgccc tgtccttctc ctcggagtac 4260 
ttgaattgat caatgtagrc ccatccatcg tccccattga cccaqcccca ggtgcggccg 4320 
tccttctcaa cgcacagata atcggccttg ccgt tgccag aaacatcacc aaaatggacc 438C 
ggccggtcaa agaaaccaag gcctcggtgc tcggcgcagt aaagctcatc tgcagcattg 4440 
atattgtagt cccagtcaaa actccccctg t ct t tea t ct tgttccgcca aacttgggcc 4500 
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ctcttccca: tctc.cccatc cciccaca:; acaicccacc ciccctcocc ctcccagtct 4 560 
cccaaacoca ga lCgcgccc czcaaoczz z accccaarc: cctczzccoc ccccaaaata 4620 
azczcaztcc ccc zeeacaa czz ccac::: ccccccocca c::caaccac cc zeezazzc 4680 
caataaacac ccacataocx gc::caa:cc azecaaazgz aotccatca: accaicctcc 474 0 
ccccccatca :a::ccaata cctcictccc zeeqezazaa caazzaccaa z zaazczczc 4800 
tatgagaact tec c tec a via cc::;ca:c*. cacccccicc :ccaccct:c c:c::ccaca 4 8 6 0 
catgaaegta atacagcgcc ccaaaacac: tgtcaqaggt acccticuc ancaacacat 4920 
agtcctgtct ccccaactct ccaaaatccc ccaccicccc aiaaactccc ccaaaotaca 4960 
cccgatcxcG taatccgaa: ctcccgaata uactccca: zccqqgazac gacccgccac 504 C 
agz zaqca z c ttggtagaac ccclQgcccc acacgacc:: caacccgtca ccacaczczc 5100 
etttgatgea getcegggag ttggtgtatg ttgtcgtagc gccatcgtca tcaacccaca 5160 
tccagtcgtc gcgcccgtcc ccgttgaca: ccccgaatcg cacgccgcgc aggtctccca 5220 
tgactttccc cgtgaagcgc ttgcccagcc gctgccagca tgeggggata tcttcgatcc 5280 
atccatttcg ccagcatgtg acgtcgccc: igtcagccaa acegcagtag tcgccgcggc 5340 
cgtcgccgtc gatatcaqca aggegtaca: gtgectqate gtatccttcg cgactcttcc 5400 
.atagtccgat atcggtgaag aatggaggct tgttcctctt ggcgtcgccg tctccattgt 5460 
tgatgcctgc gtaggtggtc ccgtccaacc egatgeagat gtagtcatca agtccgtcgc 5520 
ctatcattta ccacacaiaa atcagtccca gatggttcct eggcagacag acacagtget 5580 
tacegttgat gtcgataaaa tgcacacccc ccgggttaca attattatga gtcgagaacg 5640 
tccccccctt cgtaaactcc tgctcctcac tgtggacatt attcgtgtaa gaaacaaacc 5700 
tcaccgcgcc ggagtcctta tctttgtgga agatcatcat gtcatcatac gecctagtat 5760 
aaagcctccc gaagaaaaag tgcagctcgt cgtcatcctt gtatggatcg gctgcaccct 5820 
tcccggcgcg gacagtaaac aacgccccgc tatactcget gtcgtgtcga tagattccgt 5880 
catcctcacc actcccttgc tgcgtgaacc cgccagcgta gactccgctc ccgiactctt 5940 
tgtcacaggt ccccgtggat gagatgtcaa ggtccgccgg cttgacaatg agctccgcca 6000 
ccgcagcgtt gtagatcgcg ttataccaga tgtctgecat ctgggagtac ccgtagtcgt 6060 
tggggtgctt gttatcggcg aagttatccg ggtacgtgat ccagttgttt ccggggctgg 6120 
gagceggegg atccatatcg gecaggaega eggagacatt ctgcgcctcg cgcaigtcaa 6180 
ggaccagctc gcggaactgc gcgttgacgc agggcctgtt agcttcgagg gttctggaac 6240 
ccgaggggat cagggtcgac aggacgatga gcgtgttggc catgtccggg gccccgatta 6300 
gggtttcgat cagggagege atgcgctccc cggcgttcgc agggtcgacg ttgtagtcgc 6360 
agtcgttggt gccggcgttg atcagcacga cgttcggctt gtaggegage gagtttgegg 6420 
ccgcggtttg cacttgegtt atcacgtcgc cgctgtgggc ttctacatcc ttt:cccgcg 6480 
tttatatgtc aattggtctg tctatctgtc ggaggggtgg gggaagagag atgeatacat 6540 
tgtctaccat gtcaccgttg gaettactge ccaccatgtc cacctcccag ccttcaaagc 6600 
gaagtttgtc aeggagaggt ttgegatate catttceggt tgaggatagg tatccccagg 6660 
tgatggatgc gecaagaggg aggatccgea gagggacttt cagggcatca tctcgttttg 6720 
ctcttggacg ccattcagca gcgtcaaagt ctatatatct egaategtet gtgggtcgcg 6780 
agacagggac ggctgctcct gtggcagtag atgatagaag agccagtgtt agattgaaca 6840 
ccgtaaagaa aagtagagac gecattateg atgetgaetc tctgcacctt tcactcctct 6900 
tccatactgc gagtegcttt tatactttca cagccctcca gtcatttcat gtgtaagatg 6960 
cctcggcgta tgtgccgttc tgaaacaagr gtacttccaa gaatcgegag ctcgagtatg 7020 
gtaccaggat aaacctggat acrtaggtat caaagcatga gaccctggca tttttcatgg 7080 
caagtttggg ggecataate ctgtggcaga ggttcatatg eggeggtega gtcaigcaat 7140 
gggtttattt ggcgtagttt aeggagcaac caatcactgg cccgaatatg gtaccaggat 7200 
gaaccagaat agtaccaaag gatgagactg gcatccttca tagecataac tgttgggggt 7260 
catcatctgt gacagggaga tatgcagtga ttgagtggtg caaegggett aaatgtaatc 7320 
ggtgtttgca actacgegga gtgctaggga ggeggctgat tggcatgata agcaaaggct 7380 
tagctgagac atggcactag gtagaggcta ggacctgcgt aagcattcat tccgatgctc 7440 
tattggataa gttatttaca atctccgcat taggeggcaa tccttaatat agaatactag 7500 
tatagagcac tatggacact ccgacgttca tttaatatct ccacccgtgt tacccctctt 7560 
tetgectttg atctctatca agctggccct tctggcattt atecttgeat taaacatgac 7620 
tctaccaaca cttcctaact ggataaggat gtgcgtgcat ttgtccctta cacatctcca 7680 
tcagcaccgt tccccgaaat acgagtctat acctattaaa agtatccagg ctaattcaca 7740 
cagaatcctc atcatcctaa ccacagcctc cttctacccg cagatceggt gcatccaact 7800 
tcgaaactcc acgcacggca tctc-caccgc ctacatcctc ttcaacctaa tcagcgcaac 7860 
agaacacttc accatcctat tegcattget ggtaaacagc ggcggagatg tccccatcca 7920 
tgagcccccc acgaccggcg acgggttgaa cctgtaccag ettttegcag tgtggatggg 7980 
atgcttagtc ctcttctgcc aagcaatcca tagcctccac gccaatccac gccgcaaact 8040 
catcctacta accatataca ttcaatacct atgeatttet atcttaccag aggrcatcga 8100 
cgcaatcacc actcccgagg aaacgagaaa acaaaggecg ccaacgggcg agaggaactg 8160 
getgategga ctctttcttt ccgcgcacgc gatgacegtc ctgccactat cggccgtgct 8220 
ccgcatcgcg ggattcatag ateagtcgeg actgatctcg cggcgcagac ggcagcagcc 8280 
ateggtctta agectgacag gcctggcgtc teaggcegtg gtctttgetc tactttctgg 8340 
actctggqta ctcagggttc ageagectgt tcctcgaatg ccgatgagaa gacctgtgga 8400 
ttggatgtat tggtaccatg taattggqtg gccggttgtc gaegatgegg tttatgeget 8460 
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coaacaaicc ctt ccctt :t cc'iatoccct 
iCtaegccC'c ce:cc:cccc acac: oa z oa 
coccacaacc ccoactrc*. i coatrcictc 
caeca lQG'lC zooaagczaz acz ca z cc cc 
ctcttggtta actctt: :cc c::i::::a: 
ctgeaggaga aagagaacac ::a:accc:c 
tcaaactacr aataacaccc z qa a z goo a z 
ctggctgcaa gccctaaaca ccrgrciac: 
tctcgcaagc ctaccactac cttatagtca 
agtatttacg qctcatacaa aaagtaatat 
catctatcgc ctagattggt agetegcagt 
cccgtggtcg caattcaacc taatgcagga 
t a 1 1 gat ct t taattgtggc *,:catccgat 
cattcgaatt acaatgttga acctctgcca 
tctctccacc agcgttgaag zazaa tcccg 
a t accaact c taattata a z cagaaaaaaa 
ctgccaatcc tegcagacta tatataagaa 
agattttagg acccgctccg cgeggagegg 
tacaaaatac tagaggegge gaggtcgagt 
gttattttag cgegaaaaca gggtactaat 
tagtaccaca caactacatt acgattatat 
agcaccttac acgtggccgc cctcaactac 
gacctaactg agtgtagttt agagatatat 
attaegcaga ct tt t t acta aaagcataat 
aaggtagatt agggtaggta cgactgcggc 
cttctggcgt agactactgc tatagtgaac 
gcatcatgag cttcgtggtg egatgaatge 
acggttcgtc ggggaccacc ttccagtcgt 
tgatctcctt agaggcaaag aaccggccgg 
tgtggtcccc gttggtgttc tccagttggg 
ggegcatata ccggtaagga tegtattttg 
tgeggtegge agctaccgcc accagctctc 
tcacatcctg caatgeatag ctgcgcatgg 
actccttgag acaactatcc agcagtttga 
ettggecgat gaccgtccgg atctcatcac 
cgatgtccac caacccaccg atcagcaggt 
tggccagttg cgccccggct gcatcgtacc 
ggatcgagtc taegtagega ggcggctcaa 
cacgtcgctc ctggataatg ggatcgagaa 
gtttggctcc ctggggctcg agccaatgta 
gcagctggcg cgcttgaatt gccatcgtca 
cctcattatg ggctaacttg ctcccgacca 
ccagacactg gttatagaca ggcactgtgt 
tcatcagcta atggcctaaa aacaaaaata 
caaaaatatc cgtgataatc ccgctggctt 
cttgggtctg gaactggttc ataaccacct 
ccttgaatcc atcgaatccg ggaagatgag 
agtaagtcag gtcctcacag eggaagcagt 
caagaacttg tacatacaga gctctttcat 
aagaatggtg cagtacccag agtcgactat 
ccagctctca atccatgacg gtgeattege 
cagttcccac catgttttcg gattgaacac 
caggtaaaga agaaggcata gtagccccgc 
gccatgttgc tgcgtgtcat tccaagccag 
gtcgacagtc atggctagga gaccaggtgt 
gagcaaaaga teegggaaag gtcgegaagg 
ttccctatca tgcaatcacc gcttgctgta 
tccccaccga ataaegcegg acagctgttg 
ccctaccttc acttactcaa tcggcgcttc 
aegggectta atagegaccg gccgggccgg 
acccgaaatc tttggcccgg catacatgag 
ctgcctcagc ttcaatccaa gctcacgagt 
etactgeaaa cagcatggct caatctatgt 
gtggttgtcg cttccctggt gacgccaaca 
atcctcgcga tgtgcagagt cgaatcccca 
acccggacgg gaagcaccac gggcgaacaa 



zzzzzzqeeez ictcccgccc atgctaggga 8520 
cc:ct;acca gagca:gaac gecatggcta 8 58 0 
:a:ai:ca:.:c tcccaccata icgcgtgcaa 86^0 
azoaazzzzz zqacaaqcz z qczazaazaq 8 700 
zczqccaz ac gctctcattc agaccacgat 8 7 60 
ccatgattitc aacc~ z cc a a aczczzazza 8 8^0 
ggcttacget z z aqazcaac qaczzagaqc 8880 
ecaegt tata autaccaac zcz cccgagc 8940 
atagattgtc qtatatccaq gggectttta 9000 
gagacagctc tactcttttt acatcatcta 906C 
gggtcatccc tatctgttac cataaaactt 9120 
etttctegtg ggecgattta ttactagacc 9180 
cgctactacc tacagccaca gtgatcatag 9240 
cgattactat caggtccttc tagttggacc 9300 
tagactcaaa aaccaacccg acctgatcca 9360 
tattttaaaa attaaaattt atatagatat 9420 
acccgtaatc tgegeggact atagtagtat 9480 
ageggactgt ggatctgegg agagttagta 9540 
tagegcagtc tagtactaaa taggaaagcg 9600 
atttttggac ccccactgta ccttcacgga 9660 
atcatctttc cccctcccat ggttttccca 9720 
ctagtttgtt gacagaagtg cagtaacagt 9780 
attctactat agggggctaa accgtggtgg 9840 
agctacgata ggtagaatat aactatagat 9900 
ggcttgatta ttccttgaaa atgaeggget 9960 
caggaaggcg gatatcctcg tcgcgccggc 10020 
geaegctgaa agaatggegg tagtactgea 10080 
ategtatgag caagtaggct aacatcatct 10140 
ggcaagcccg tggatgccaa ccgaagccga 10200 
cactgaacgc tttagccggg tcctctcgca 10260 
ccggctctgg ccagacctcg gggttgctca 10320 
cttttgggat aaaggttcca ttggagaaag 10380 
tggcacattc gaegggcttg acgcgctgtg 10440 
gcttgtacag cgaggcaggt gtccaacccc 10500 
ggaggggctc aaggagatgg ggatgtcgga 10560 
eggaggttec gtagatacca gcaaagtcca 10620 
atttcccctt ggcagtatcc tcgaaccact 10680 
tgccctttgc ccggcaggca tctctttccg 10740 
gttgeeggge tcgtcgaacc tgcgcccgga 10800 
caagggggcg cagaatgacg ggecagagge 10860 
ccgcgtggtg cttggcgatg tcaagccact 10920 
taataaaagt cactgttcgg gtcactaagt 10980 
gecattctga aagcaaaagc acatccctct 11040 
ctcatcatga tcacttacca ttgctgtcgc 11100 
cattggcaag aggcttgacg tacttgggag 11160 
tggtgatgag atgtgcatcc ctegtgaett 11220 
agtgaaagtc ctgatgagag cacgaagatc 11280 
tgcaaagaac ggtggactcc ttaccgtgcc 11340 
ettgegaaac tcatcggcca tagaggaggg 11400 
gaaccgaatg ggcttatcat tttgcgagaa 11460 
atcaaaatcc cqtttggccc tcatggtcgt 11520 
eggcagatea gatctccggc cactcgagca 11580 
actggtagtg accaagggcg caaaccacga 11640 
cgacagaagg tggtgcggct gtgtgagcgc 11700 
ggttgaggga taagatatcg agagtgatgt 11760 
aaagggcgtc tctcttacca agaaagtctg 11820 
cggtggtgat gatgctggga tggtggtggg 11880 
aagccgaatg acgccggcag gecaaaagaa 11940 
ccctcctatc accaaategg atgtaaatgg 12000 
gaatccccaa aegtagatag ataggcatag 12060 
cacaggaaqt ttcacgcgac ggcgcctttc 12120 
tctgtcgcct ctatcagtcc cqcaa-.rgtc 12180 
atcctaatga gcctattgtc gtggtcggca 12240 
caccctccaa gctctgggag ctactccagc 12300 
aagaacgatt tgaegtcgae acattttatc 12360 
atgcacccta cgcctatgtt ctccaagacg 12420 



37 



3NSDOCID: <WO 0037629 A2 I > 



WO 00/37629 



PCT/US99/29583 



azcz gcccc: c::ca£::cc zzczzczzca azaz ccacoc icnacaaccc Oaoectctgc 12 4 80 
ccccccicj;- crcccic~~c rirccacacgc ^ciaccaccc cccaacccai cc:ccaaicc 12 54C 
czaLccacca :;:ccac:aa aczzccaczc czczzzaccz ccgcqtcaic acccacgac: 12600 
acgacac:g: czcaacccc: gacc:ggaca cca iccccac czaczcggcc acccc: cccc 12660 
ccctcactci zccgzccaac ccca :ctccc ai::t:t:aa ctcccatcca ccaaczcua 1272G 
gtcacccaa: aicctctacc aczczaazca igctccaacc gaccggaaic c: icaaacaz 12780 
gacgcicgcL acgccancca gctcctcctt ggt rgccg t z caret gcccc :ccaacaacz 1284C 
accaacgggc caaagctcca tegcaac tgc tcccggigcc aacctga::c zcccccccaz 12900 
gacaetcctc c::gaaacce aaitgaccet cctazccccc tcggcccgat cccgcacccc 12960 
ggacgccgga cczoaccccz atgecagagg cgtgagtgrt tcccgagctc gcagatgaca 13020 
gttcccatcg ctcaccgcga tcaggaagct gccngccctc tagtgttgaa gacattgagt 13080 
caagccttgc gcgangggca cacgattgaa zgzgtcazcc gagaaactcc cgtgaatcaa 13140 
gatggccgaa ccaccggaat tacgatgecg aacca tag tg ctcaggaggc actcatcaag 13200 
gctacctacc cccaggc:gc ccttcacazc accaaggccg aggacaggtg ccaattcttc 13260 
gaggctcaic gtcagcaaag agaacctgtt ctgttggcgc cctgcagctg acattegtat 13320 
gacagggac: goLactccgg ccggagatcc ecaggaggeg gaggecatte caacagcctt 13380 
ctccggccac gagcaggtag cacgcagcga eggaaacgag agggcccctc zqz tcgtggg 13440 
cagtgcgaaa actgttcccg ggcacaccga gggcacggcc ggtctggctg gtcccacaaa 1350C 
ggcgtcgttc gctgtccgcc atggggtaat cccccccaac ctgctgt teg. acaaaatcag 13560 
cccgcgagtc gccccattct ataaaaacct gaggattccg acagaagct'a cccaatggcc 13620 
agctctccca cccggacaac cgcgccgcgc cagtgtcaac tcctttggta agegagqatt 13680 
geceggagga accctcacaa gtactcgaat taatgetaac tgaaccgcgc cgatggacag 13740 
gatteggegg cacgaatgeg catgecatta ttgaggaata catggageca gagcaaaacc 13800 
agetgegagt ctcgaataan gaggactgee cacccatgac cggtgtcctg agtctacccr 13860 
tagtcctctc ggegaagtec cagcgctcct taaagataat gatggaggag atgctgcaat 13920 
cccttcagrc zcaccccgag atacacttgc acgacctcac ctggtcctta ctgcgcaagc 13980 
ggtcagttct accctzccoc egggctattg teggecatag tcatgaaacc atccgccggg 14040 
ctttggagga tgecatcgag gatggtattg tgtcgagega cttcactacg gaggtcagag 14100 
gccagccatc ggtqttgcga atcttcaccg ggcagggggc gcagtggccg gggatgttaa 14160 
agaatctgat agaggcatcg ccatatgtgc ggaacatagt gagggagctg gacgactccc 14220 
tgeagagett geeggaaaaa taccggccct eg tggacget ■ actggaccag ttcatgetag 14280 
aaggagaggc ctccaacgcc caatatgeta ctttctccca gecattatge tgcgcggtgc 14340 
aaattgtcct ggtccgtctc ettgaagecg cgagaatacg attcaegget gttgttggac 14400 
atagctcegg egaaattget tgcgcctttg ctgeeggget catcagtgcc tcgttggcga 14460 
ttcggattgc ttacttacgt ggagtegtet eggcaggggg cgccagaggc acaceggqag 1452C 
ccatgttggc cgccgggatg tcctttgagg aagcacaaga gatctgegag ttggatgcct 14580 
ttgagggccg catctgegtg gctgccagca attccccaga cagtgtaact ttctctggcg 14640 
acgcgaacgc aattgatcac ctgaagggca Lgttggagga tgagtccact tttgegagae 14700 
tgctcaaggt egatacageg taccactcgc ateatatget tccatgtgca gacccatata 14760 
tgcaagccct agaagagtgt g'gttgtgctg ttgccgatgc aggttcccca geeggaagtg 14820 
taccctggta ttcgtccgtg gaegecgaga acaggcaaat ggcagcaaga gacgtgaccg 14880 
ccaagtactg gaaagataac ttagtatctc eggtgetatt ctcccacgca gtgcagcggg 14940 
cagtcgtcac geacaaggeg ctggatatcg ggattgaagt gggctgtcac ccagctctca 15000 
agageccatg cgtcgccacc atcaaggatg tcctatctgg ggttgacctg gcgtatacag 15060 
gttgcttgga gcgaggaaag aatgatctcg attcattctc tcgagcactg gcatatccct 15120 
gggaaaggtt tggtgcctcc agtttcgatg eggacgagtt catgcgtqca gccgcgcctg 15180 
atcggccctg tatgagtgtg tcgaagctcc taccggccta tccatgggac cgctctcgtc 15240 
gctactgggt ggaatcccga gcaactcgcc accatcttcg agggeccaag ccccatcctc 15300 
tattaggaaa gctctccgaa tacagcactc egctaagett ccagtggctg aattttgtgc 15360 
gcccacgaga cattgaatgg cttgatggac atgeattgea aggecagact gtcttecctg 15420 
cggccggcta tategtcatg gcaatggaag cagecttaat gattgetgge acccacgcaa 15480 
agcaggtcaa gttactggag atcttggata tgagcattga caaggeggtg atatttgacg 15540 
acgaaqacac cttggttgag ctcaacctga cagctgacgt gtctcgcaac geeggegaag 15600 
caggttcaat gaccaeaagc ttcaagatcg attcctgtct atcgaaggag ggtaacctat 15660 
ccctatcagc caagggecaa ctggccctaa cgatagaaga tgtcaatccc aggacgactt 15720 
ccgctagcga ccagcaccat cttcccccgc cagaagagga acatcctcat atgaaccgtg 15780 
tcaacatcaa tgetttctae cacgagctgg ggttgatggg gtacaactae agtaagaact 15840 
tccggcgtct ccataacatg caacgagcag" "atcttcgagc cagcggcacc uagacttca 15900 
ttcctctgat ggacgagggt aatggctgtc ctctcctgct gcatcctgca tcattggaca 15960 
tcgccttcca gactgtcatc ggegcatact cctccccagg tgatcggcgt craegctctc 16020 
tgtatgtacc cactcacgtt gategcatea cacttgtccc atccctttgc ergacaaegg 16080 
ctgagtcegg atgegagaac cttgccrtca atactatcaa tacqtacqac aaggcagacz 16140 
acntgagegg tgacattctg gtgtttgacg eggagcagae caccctgttc caggttaaaa 16200 
atatcacttt taagcccttt tcacccccgg atgettcaac tgaccatgcg atgtttgccc 16260 
gatggagctg gggtccgrtg actccggact cgctgctgga taacceggag tattgggeca 16320 
ccgcgcagga caaggaggcg 'attcctatta tegaaegcat cgtctacttc tatatccgat 16380 
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CCH ICCt CcC 
cCc lCOcC'C 
cCCCCGGC Z C 

a ccaccc tea 

coa eccocaa 

acacact cac 

ategctatea 

acgtcctggc 

gattcttcca 

ccctcgatat 

tea ttgeetc 

gctctctgct 

caegget egg 

getgeact ga 

1 1 tccggtgt 

ttagtaccca 

ccaccgtcaa 

agcgtctcct 

gee tcgtgga 

cggagctgga 

tgctgctcac 

ctcaccaggc 

gagttcatgt 

aggtgetteg 

aacccgaggt 

ctegcaataa 

gggctcccgt 

aaacctggt t 

ttagctgtcc 

acg tccagga 

ttg tgcacgt 

gt ggcagect 

ccgccaagtg 

ggcagatgtt 

ctggcaacag 

gcgacacgga 

tatcagcega 

gctgtgccca 

gcgga tegeg 

cccgccaggg 

cagcgcatgc 

eggacgagge 

ccgacaaaae 

gga tggtcca 

ccaag tggct 

aggagtcctt 

acgtgacaag 

caccagtggg 

tggaactgee 

acgagaagtt 

cgat tgtggc 

ttcaageget 

t ttgetaaac 

tgccgtqt ac 

teggt tea tg 

ggtggccggt 

cgaca tggct 

t cggatcacc 

ggecaaagea 

ggegacgaac 

a tattctccc 

tggactct cc 

caeca tccca 

gttct ccaag 

caccgatctc 

agccaccaac 



zcazzz :.acc 
cctccaacaa 
c c a c a a tea ti 
tcttcccctc 
ccca 1 1 cgac 
c z t cgcaccc 
gtcaatggat 
caccccccac 
gcaagcgccc 
ccgccgcag t 
caa tgtgcta 
caagect gga 
gtt tatct t t 
geegt t tote 
goacagtege 
tgcaa t tgac 
ggactct t ac 
gaaegata t a 
1 1 tgetagae 
cgaggaa t ta 
gt acgccagc 
gagcacga tc 
tctggacgtc 
get tgaggag 
ctcctggtgt 
ccqaa t gaac 
ggca t tacag 
t gtgectgag 
cca tgeget t 
gggcaa tege 
tcgtcccga t 
t a tggtaacc 
cctggggg t a 
gctccatgct 
gagt teggt t 
c t ggcacctg 
ccagagctgt 
ttaccgtgcg 
gca tcaagct 
act tcctagt 
cgacaagacg 
t aeget tec t 
a ta t ctcctg 
geatggggee 
ggcgcatgt t 
ccttctgcaa 
ccaaaactca 
gggta t tgee 
aatgatggag 
ctccga tccg 
cgtcatggqc 
ggcgcagcag 
actcct atgg 
gqcgt t gggt 
t tcgatt egg 
cgacgaqccg 
ga tctggaac 
ttct tcgacg 
ggcgaagggg 
ctggaccagg 
cttcctcact 
gcgaagctgc 
ctaa tcgatc 
cagctgtacc 
gctaa tgagg 
qqooo tacaq 



ctccaccacc 
c t cc t gecca 
actgaccc cc 
ettcagegag 
ct t ctgcacc 
ccactacact 
attctggaga 
ctcccc: c ca 
gagcaa 1 1 1 g 
cccgccgacc 
ca tgegacac 
ggecaga t gg 
ggtetgt t eg 
tegt t cgacc 
accaeggat c 
occaccgt gg 
cctccct tag 
aaagegacca 
geggaggage 
t tcgccgggc 
aatacggt ct 
ggcatgetae 
gaegeggt tg 
cataegga tg 
aaaggccgcc 
t cct cgcgcc 
aeggcteggg 
agtg t tcagc 
agggt eggae 
gaagtgcccg 
tata ta ta ta 
gtcc tegecg 
actgact caa 
ggtgaagaga 
tetge tggag 
egaegggtae 
gaaggt ttga 
gcagacctgt 
tcactgcccg 
gtcagegagg 
cgcccgga t c 
accagggt t c 
gtcggactga 
tgecaca ttg 
gaagaactgg 
ttcctcctta 
gtggaagctg 
t ttggccctc 
at ggtgctca 
accagtagca 
aacccgggtc 
egagttgeat 
gect t tacta 
tcgtcactcg 
t tgaggaaca 
tgcaccagca 
tgacaacegg 
acccccgca t 
cagccggctc 
teegtcagat 
cageggactg 
aggtgaccct 
agggggtgga 
ttgatt tgee 
ctgctgcgcg 
agagcac t ga 



CCCc c Cc CCC 

cccccaacca 
acattcacea 
teggecaaea 
acga tcccet 
acocccccca 
ttcgaccacc 
acagctaeac 
ccccctteca 
aeggcttega 
ccgacctaga 
t ta t tctgga 
ccgactcetg 
gctggga tgc 
gggacgcaaa 
agtac t taga 
tggtgg t agg 
tgcctcctcg 
tgccgatgaa 
tcactgaaca 
ggc t ga ca ga 
gctccatccg 
aaacct t cga 
agctggccag 
cgcggacicc 
gtcccatata 
attcatcatc 
aga t gga aac 
agetegggt t 
teg tggcec t 
ctgaggcaga 
eggeggtc t 1 
tectegt t ct 
teggtet tea 
aegecaagt c 
tgccccgggg 
ctcagagga t 
tcacagacac 
cegcatat tg 
ggtgggacgt 
t ct cgacagt 
get cca t cga 
ctggaga t ct 
tact tacgag 
gtggtcgact 
tgat cccgac 
gectgget aa 
tggt tetgea 
accccaaggt 
accctctcga 
aggctaacta 
ceggatt age 
aacegggcag 
ggeggagct g 
tgaactgea t 
agagcagcag 
aat tccgccc 
aggcaact ta 
caagggctcg 
cgtcatcggt 
gaga ttaacc 
gcaga tcccc 
ct ctctgcgc 
actcctgaaa 
at tgccacct 
caa tact tec 



acccctccat 
ccctco tcae 
cc tttctaec 
cc tgctcccc 
cctcacacac 
attcctgccc 
gaccggccgc 
atacacccai 
ggaceggat g 
gecgea tgee 
gaaaaccatg 
ga t zacccac 
ggctgggg tg 
ga tcctaaag 
tctat tcccc 
cgcgcccc 1 1 
agggcagacc 
tecget ccag 
gtccacgt 1 1 
gacct z cgag 
aaa tgcctgg 
cegggagea t 
tgeaaect tc 
t tcaactaca 
tegtctgaag 
eg aga tgat c 
ctact tcttg 
aaagacgat c 
tttctatctt 
ageagageg t 
taacaatctg 
ggcggagacg 
gaatcccccc 
agt tea t ct g 
ctggctaaca 
tgtccagqct 
gatgaaagt t 
cgtttccact 
ggagca tgtg 
ga tgccgtgc 
tatt tcctgg 
cget gagace 
tggacgat ca 
cagaaa t eeg 
cactgt t ct t 
taaegcaget 
actcaagga t 
ggatgtga tg 
egaaggegtc 
ettcttegtg 
cagtgegget 
agtacgtt tt 
gcgt ccacca 
gaggaggact 
acactgtt tg 
eggaagt t eg 
ctggatccag 
aaaattcegg 
gtcaaagaac 
aagt tgagcg 
get tctt ttc 
ga t ggggaaa 
gcggt caccg 
gtgcttgggg 
agctcca t tc 
gagaa tgaag 



ttgeagaace 
ct ategtacg 
cctaactcct 
accc t acga t 
1 1 ctatacca 
c a c a t coccc 
cc taecaagt 
a t e t cca ccg 
ct ct t tgaac 
ta tgat ctga 
gctcacgccc 
aaagaacaca 
gatgatggtc 
eg tgtcgggt 
acct ctgt gt 
cccagcagcg 
ccccaa tctc 
aca tacaagc 
gt catgetea 
ccaaccaage 
ct ccaacatc 
cctgacttgg 
ctggt tgaac 
tggactcaag 
cgcgatctgg 
gatt egtege 
gagt ccgctg 
ta tgt ccact 
gtgcagggtc 
aacgea tcca 
t ctgagggtg 
gtgat cagta 
agcat a tgtg 
gccaccact t 
t tgea tgetc 
t tagt cgact 
ctgat gectg 
gaat tgcata 
gtatcct tag 
actcaatttg 
ccccgggagt 
ctct t tgegg 
etaggtegt t 
caggt gaacc 
t cca tgtaag 
ggct t caggg 
ctgea tetge 
ctaaa taata 
cgea tectge 
atgttctcct 
aactgctacc 
cactccatcc 
tegacategg 
ttaatgcaa t 
ctgaggcagt 
egacagtget 
ccct caaaga 
agtaccgagg 
agctct tgea 
aa teegggga 
ct t tggcaga 
gcg tgea tec 
tgggaacctg 
gtget tcga t 
ccctcgtcgc 
t t tegggacg 



1644C: 

1 c r 6 ' 
i £62 C 

1 6 6 £ 1 

i'esc: 

:ce£C 
1692C 
I696C 
I704C 
17100 
1716C 
17220 
17280 
17340 

13^00 

2 . hc: 
17520 

i*5e: 

1 7 64 0 

17700 

1776G 

17820 

17880 

1794C 

18000 

16060 

16120 

1616C 

16240 

1S300 

16360 

13 4 2 0 

18480 

1S540 

18600 

18660 

18720 

18780 

18840 

18900 

18960 

19020 

190SC 

19140 

19200 

19260 

19320 

19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

2C220 

20280 

20340 
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ccaqcazacz gacczzaczc cccccccca: cazcaczcac ccczcczczc c::; z zaaca 1C4CC 
ccatacgcac ccccgccacc aggacciccc occttcccac ca:ccac;c: ::::::ccca 2 04 60 
acaa zaczcc zggagaazcc aacacccac; cgaagacccc accc:ct::a acaacaccai 2 Ob 2 C 
tggtatgttc atgaacgcct czazzgaccz caaacocccc tacaacgcgt: cgagagcggt 20580 
cr igcaccgc cacaacarci rccgcacggg ere tcccaac o:cca cgaca acccrratggc 2064 0 
ccagctggtg trtggtcaaa ccaaaaacaa agtccagacc a z ccaacz c c ctcaccgagc 20700 
cggccccgaa gagggctacc gacsactggc ocagacaccc rataaccccc ccccaggaga 20760 
caccttgcgg ctggtggacL tc^tcucggg ccaccacgac caictgctgg ntczggctta 20820 
ccaccgac t c etcgeggatg gatciaciac aaagaaca ic zzee t cga a c cgcgccagct 20880 
ctacgacgcc aegtegctaa ceccacacQi ccctcacttt ccgcacccgg cgccaccgca 20940 
acgcgcaatc cccgagcacg ggagaatgga ggaggatccc ccgtactgga agaaaatgea 21000 
ttaccgaccg zcczcaazzc cagzgczccc accgatgegg cccctggtac ctaacsataa 21060 
cagot ccca z aczecaaazz rccaccaccc ;ggaccc:cc caccagcacc aac::::goc 2112C 
gcgact tgan ccgatggtgc ccctccgcat caaggagege agccgcaacc acaa::caac 211&G 
cccgatgcag ttctanctgg eggegtatea ggtgctgttg gcgcgccica ccca:agcac 21240 
cgatctcacc gtgggccccg ccgacaccaa ccgtgcgacr cccgacgaga tcgcccccat 213C0 
ggggttct cc gccaaccccc cccccctgcg ettcegggat ticcgccccc ata:aacgtt 21360 
tggegagcac cttatcgcca cccgtgacct ggtgcgtgag gccttgcagc acgcccgcgt 21420 
gccctacggc gtcctcctcg atcaactggg gctggaggtc ccggtcccga ccaccaatca 21480 
acctgcgcct tegttccagg ccgtcttcga ttacaagcag ggecaggegg aaag:ggaac 21540 
gattgggggt gecaagataa ccgaggtgat tgccacgcgc gagcgcaccc cttacgatgt 21600 
cgtgctggag atgtcggatg azcccaccaa ggatccgctg ctcacggcca agttacagag 21660 
ttcccgctac gaggctcacc accctcaagc cttcttggag agctacatgt ccctrctctc 21720 
tatgttctcg atgaatcccg ccctgaagct ggcatgatgg cgcaaacata gaacacgata 21780 
gegcagcagg gacgatgtag atagagcttt gettctgegg gtggatctat aata:agtat 21840 
atataaatat ggtgagccga acgaagaggg gggaatgeca caattattta etc::: zqcq 21900 
ccgtacacga ggagaagacg tccagaacaa cataaatata tcactctagt gagacaccat 21960 
atatteggag agactataaa aatatacatc tactccaatg tctgggccgt cacacacagc 22020 
ttacgaaaac gattaatgac ctccaacacg tcgcgcggtc gattgggaaa ctga:gctgc 22080 
ccagcaaact ccaatacctg cgcctctcgg ggggagaaat ggcgcgccac cagcatcttc 22140 
gatcctgega gegcaaaate atcgcgaccc tgcagatgta atgtcggtat ccgaatgacc 22200 
agttcctcct gccactcggt atetttgetg tcgttgtcgt cgtcatggtt cttcatcact 22260 
tcgttcctca tatactggct tgcctcgtct tgataccagg gacagatcaa cagcgcaaca 22320 
ctcatccggg gcaaccaggg caggtgaccc atetgetget gecagaggag caaggtegtc 22380 
accagggcac etteggagaa accgatagca cccacgatag ggatgtgggg g tgt tgagtc 22440 
tgecagtega caatggtgcg gcggatgggg tcgtggacgg eggegaggeg ttccctcacg 22500 
gagggtccat tatgattgtt gtcgctgctg ctttcaaacc aggagtaata tggecctagg 22560 
teggegaaga eggggagaat cccaggccct gcagaggaag ggaaeggage tgtcaegtag 22620 
acgaattcaa agttttcgcg cagcgcagcc eggagcttag acaattgeae gcggaagatg 22680 
gegggagage agccagcccc atggatgcaa agaagegetc gaggegcett caccagcgct 22740 
ggagatgett ggtaaegcat ggaggagggt acaatgggac tatatcctgg atgcaagacg 22800 
gggatgaggg agtgtcgagc ttacacgttc accagegatg aacegctatt atcccaacgg 22860 
aatatcctgt ctaacactct gcatctaccc caggtggacg ggacaageca gegargcega 22920 
ttacatccat aggaacagca gtttgttcgg aacttccgcc tcaccccctc gatateggge 22980 
geaggatgee ggcgccgaat aacgccgcac aaacccgaac gggtctgcag gtga:cccga 23040 
agccctaatc caaggatcgt ccgtccttct gtctatgtct ttcegcatat gtaggecgea 23100 
gegtaccaga tacgtcactc aacagttaac cagagaagac gaeegtgaca gaetgecatg 23160 
ggcgaccagc cattcattcc accaccgcag caaacagcgc tgacggtaaa tgaccatgat 23220 
gaagtcaccg tctggaatgc cgcaccctgc cccatgctgc cccgcgacca ggtatacgtc 23280 
cgegtcgagg ccgtggcgac caatcccagt gacacgaaga tgegeggaca gtttgccacg 23340 
ccctgggcgt ttcteggaac ggactatgee ggcaeggteg tcgcagtggg ttcggacgtg 23400 
actcatatcc aagtgggtga ccgggtctac ggggcacaga acgagatgtg cccacgcacc 234 60 
ceggatcagg gggcattctc gcagtacacg gtcacgegag gccgtgtttg ggecaagate 23520 
cccaagggct tgtcgttcga gcaggctgcc gcgctacctg egggcatcag tacccctgga 23580 
ttggcgatga agttgcttgg getgectttg ccatcgcctt cggcagacca gccacccacc 23640 
cactccaagc cggtgtatgt gttggtctat gggggcagta cggccactgc cacigtcact 23700 
atgcaaatgc tccgcctgta atgcttccct tgtcctgaga cttttctctc cgttggtcgt 23760 
gggctgtaca agcgatggtt atactaagat ccgctggcag gtceggatat attccaattg 23820 
caacatgctc cccccacaat ttcgacctgg ccaaatcgcg eggegcagag gaggectttg 23880 
actateggge cccgaatccc gegcagaega tcgtcagtga acccctgcca ccgcrctacc 23940 
cctcccagtc cactttggcc ttacagaaca gactattgat autcttctag cgtacciaca 24000 
ccaagaacaa cctccgccat gccctcgact qtatcaccaa cgrcgagtcc accacattct 24060 
gcttcgcagc catcggccgc geggggggge actacgtctc cctgaacccg ttccctgaac 24120 
acgcggccac gcgcaagatg gtcacgaccg actggaccct ggggccgacc atctciggcg 24180 
agggatcaac ctggcccgcc ccctatgggc gtcccggcag tgaggaagag eggcagtteg 24240 
gegaggatet gtggcgcatc geggggcage tegtcgaaga tggacgcctc gtccaicatc 24300 
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cct tgcccgt 

acggagacct 

ccgtcacgtc 

aaacgtacat 

gat t t taaga 

ttgctct ctg 

acatccacgi 

ta tatgt tag 

ta tea tcccc 

tgctcgaa t c 

aacgeaaggg 

ggctgga tea 

acaatgt ttg 

agatcct cca 

ggcccaccgt 

cgegget cga 

tctcgcttca- 

tggccgccga 

cgcccatccg 

tgetgeaget 

tactget cca 

gegcegtaga 

a tgecaaact 

tggaggaaga 

ccccgacgct 

gcgctcaaa t 

atgeatt gta 

gtgtcgacct 

aagcagcgcg 

ttgtaattgt 

tgaccgcccc 

caaccggatc 

ttttcccttt 

gaaagaaaa t 

eta tccaagg 

taagaccgac 

ctgtattgta 

cagctgtatc 

aaggtaggta 

tctcccgcat 

ceggagatag 

tgactgtggc 

caagagccgc 

agcggcgcgc 

acatggctcc 

gcatcgagac 

acccacgtga 

tt tcatggtc 

cagaaacacc 

catggcactc 

ctt caggacc 

gcggccgccg 

ggtaaccgta 

aacgccct tc 

gtagagatag 

cgcgt tgaag 

ttcagctgea 

tttgagcatg 

cgcaacgagc 

catgggtccg 

geca tgcaca 

gcagaggt tc 

egtaaatgee 

cttctctaaa 

cgtcagagat 

acagacttca 



ggtgcagcgc 
o t eggegga c 
gaeggagcaa 
gggatattgc 
a tacat tgaa 
ggaacatccc 
catataccta 
caagct cat t 
geca t ggct c 
t gcgtgtcac 
cgcacacccc 
aca t tgaacc 
ggccaccccc 
aggega t ga t 
agttgata tc 
gggcaggc tg 
acagegag t g 
agcactcctc 
ccgagt tgcg 
taaaggtca t 
ggtcgaggee 
t ccactccgc 
t ttctgeege 
cgt acgacag 
ct cgcaa t ct 
ccggaagcag 
gggccatga t 
gtageggegg 
tataa tttag 
cgcaaagggg 
ggggatctgc 
cgctgccgca 
t tactaat aa 
egggeget ta 
gcgatcat eg 
teggtgact t 
aaagctacct 
t tacgtacag 
ct t ttctacg 
gttctaagat 
cgt cct ttga 
acccgcact g 
gcaggtact t 
a t tggegeca 
ctgggcctca 
agcgatgtgc 
atatatat aa 
gtaatgeaac 
gectggaega 
ccgaccaggt 
tgtcggacag 
atccagttaa 
tgcatgtcgg 
gaaatcata t 
atgcccgcct 
gcggccgcca 
cgttgggccg 
gtagtggcac 
acgccctcgg 
gccactccca 
agetegtagt 
tgagtggagg 
tccagccggg 
ctgtaccccg 
tctgcttggt 
ggggcagtgg 



ccc t tcca t c 
aaact cgt eg 
caaacctcca 
tccgtgga 1 1 
caca tactca 
tgt ccccagt 
t aa c t a t cgc 
c t c 1 1 ctaca 
c tatgeggat 
a t cacoacag 
ccctt tgggc 
tgacat aaac 
aaaggt caag 

CCCCCCC5GC 

tgggctggcg 

aaacatcaag 

aagcacct tc 

tccatcggcc 

ctgcgt t tgg 

ct cggtgatg 

ggtggcccgc 

cccagggtcg 

ctggaggtgg 

tcccgatgt a 

tgcgt ttccc 

cctatccacc 

cgtggtcagc 

cagctgattg 

at tgectagg 

tccga ttggg 

ctgga 1 1 tea 

gcagca tcaa 

a taaa taga t 

ttt tgtctgc 

ga t egggecg 

teggt gttat 

aacctcact t 

gtccgtgata 

taggtaggta 

taccacatat 

agctgccccg 

cga ttgtgtt 

aactcgccag 

ggctccgtgg 

agaa tggtgc 

agg tt tccat 

cegggaggag 

tactgctctc 

ggt tggaggc 

tct tggccgt 

tctccgcggg 

tggcggcgga 

gaaatccacc 

tcatgcccat 

gggccaccgt 

gaacctcgtg 

aegggaateg 

cgccgccagc 

t tgtggcca t 

ggggcagagg 

ta tagtcgeg 

gcgtt ttcga 

tcaccgagcg 

cgatcttacc 

tcgatttgag 

cagagcatgg 



acattaacca 

t tegect cca 

ctttttctat 

ggga t aaa t c 

gacaactctt 

atatcataoa 

tatatatata 

aaact taa t a 

taaccctgct 

acegggtcat 

tcgat ttget 

caaaggaat c 

gaacctt t tc 

ccaaagct gc 

tccat gtget 

tccacggtct 

a ta taggacc 

eggaagtaga 

t cggcccggc 

cccagcggcg 

tcgacgagct 

t tgacggccg 

ccctgggcca 

tgegtcagea 

gcgtcgtcaa 

gtctcatcca 

agct tggtcg 

cactcgtccc 

tatgtcccgc 

gtctctcacc 

cggccttgcg 

tga tgga t cc 

gaaaataggg 

ggctggggcc 

gcgga tegt t 

gcgccgctag 

aggtaggtag 

ctactacctt 

tttttatatt 

accggtgt tc 

agaatcaatc 

gca ta tgggc 

ccagcacggc 

aa tgtgcgcc 

ctccgccaa t 

cga tgctaga 

aaaa tcaaaa 

cacattctgc 

atgggcgttg 

gt tgagctcg 

gat cgtggct 

tttcttgtcg 

ctcggcggcc 

cgcgtcgccc 

cagggtttgg 

cccgagagga 

cagacaggga 

a ttgatcget 

gggaa tgaac 

caggtacccg 

a taagggagc 

cacagcggca 

ggatga tccc 

ccggagacac 

gagggectet 

caccgacaag 



aggca tggag 

cggcccct aa 

caaccaatcc 

acgagtga ta 

ta t aaact ga 

t taagagaaa 

gat a tata ta 

a t ccaaa t ac 

ggtac tgege 

t ccagggt tc 

aat tacaata 

taggagcega 

ggcgccagt t 

gacgaaggac 

ggttca tctg 

gtggctgcag 

cagggcccga 

ccgagtcg tc 

gegcaagcat 

cacaga tat t 

tacccgccca 

gcggcgccag 

tgtattcccg 

ggtgccgcag 

acccctccag 

agtccacgag 

cactggegag 

gtcgcaccgt 

tgtca t tegt 

actgeaatet 

gaaggcggt t 

ca ttctgaat 

aataaaaata 

a tagategga 

aca teaggee 

gaaeggtat t 

gtaggtacct 

acctacctaa 

egatttgeae 

taattgetec 

acagegat cc 

at tgaccaag 

cgcggcgaca 

ccggacacct 

cgtgccgacc 

aacccaccat 

cgtagagaag 

geegga tcct 

aagccgccca 

accagcgcat 

teggegatga 

gaacagaaat 

atcgcctcca 

gtggt ggtgc 

agccgcgcaa 

gact caaccc 

ccacgcgtca 

t tgcatccgc 

aacgcctgtc 

accacgttct 

aatgactccg 

cggcgaatcc 

gccgcaatcc 

agtt ccacca 

gtttcttctg 

taggageca t 



ctegt cccca 

acucgat: gc 

gtagaegcta 

cacagggtgg 

tat acactac 

aaaaaaaaaa 

tatatcta ta 

aaa t agecaa 

a taga tgeca 

cag ttggaag 

gcact tagta 

get tacccac 

ct ctccgtcc 

catgggca tc 

ctct t cgact 

caggagcccg 

gaacaccccc 

gtagegcagg 

atccggccgt 

ctcctgcagg 

gtccaggttg 

gcgactctgg 

gagcaaegga 

cgtga tct tc 

caegggea tc 

accgcgctcc 

ccggcagggg 

ccgagccccg 

ctcggtgtgc 

egggeca tga 

tccatcagaa 

tttttaattt 

aaaccaaaaa 

cct tacctac 

ggtccccgaa 

gtcttagact 

aggtactctc 

gttaca tgac 

gaaatggatg 

gtactctgta 

cttcatttct 

t gacccgccg 

a tgegggeca 

agcaagtcca 

tccat cgagg 

tagcaaacac 

ctacctactt 

gaccagtggc 

ggctgcccgc 

egaegteggt 

cggacttgcc 

tgccagataa 

gggct 1 1 1 tc 

ggaaccgga t 

accggctggt 

agegctggge 

tacca tcacc 

gaetggeget 

catcgatcac 

cacagcaggc 

ccaggccgct 

gcacggcacg 

gctcgagagt 

gctcgtcgtc 

gggggcgagt 
cagggaegge 



24360 

24420 

24480 

24540 

24600 

24 660 

24720 

24780 

24840 

24900 

24 960 

2502C 

25080 

25140 

25200 

25260 

25320 

25380 

25440 

25500 

25560 

25620 

25680 

25740 

25800 

25860 

25920 

25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 

27240 

27300 

27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080 

28140 

28200 

28260 
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gcactctctc cczzqqcccc aaccacgcca ccgagcaccc zqqazcaqaz gctngtLgag 2832G 
gaccagccia acgaccaatg cgcccatgac acaiagtcoa cccagccggc rttccaatcc 2838C 
atcgagcaca ccczcaaazc taceacct:i agcggc:ca: aractcaacc cagcrgattc 28440 
caagacgtac tgcagtggcg gcagaacggt tacgcgtgrc tcaaacccct cgacgcgcgc 28500 
cgtcacgtac acctcgtiica ctccgttggc ccctictucc aaaggactga cagcagtagg 28560 
gogcagacca ccazzaqcca tataucccc cacgacccga cagaagaagg gcgacaattg 28620 
caggaagtga cacaqaacga agccgcccac cczcaazacc ttccaccagg rgatgtttgt 28680 
ccactzqacc zzgzqcccaa eaatccccga acgaccct z c gcatcaactt gaccaagccc 28740 
zcczqcgczc cqqazzcqcq tgatccctaa z z tcacccaa agaatcgtgg catagaaggt 28800 
aaaaagtaga acggcatcaa tcaggagtgt ccatgccccc aggaagcaaa aatggccaag 28860 
actatccttt ggccgaagga ccgcccctaa tgccaacgcg ccgatttcca ggaggtaaga 28920 
ccgcacaata taccacccct ctcgatcaac cgcgagttgg atgatgttag gaatcaattg 28980 
gttttgccgg ctatcatcac tgctggcgcc attgggaact tgccgctgcc ccccgcccca 29040 
cagttcttcc gacacgcaga gaacagcacg ggttagttgg a tcggct tct caaagcccac 29200 
tgtcagaacg aggtacggga ttccttcgaa aagaagaagc atgtcgacgg gcacgtcgca 29160 
tgtagtcgtg attccaagcc cgagtacaaa agcaaaggca ccagacagca ggaccgaggc 29220 
zqccaaccag aagcgtgagc cgaggtggcg catcacccga aagagagaga ccacagtcat 29280 
attcatggct aggtaactga gccctatgat caccaagtcg actgtctccg catggtgaac 29340 
acgatgaagg aatgacagcc atgagcttcc gagccagtgt cccagtgacc tgggacttcc 29400 
ttcttcgcga ggagacctca gtctccaact atcatcctct t tt tccgagg gtataatttc 29460 
gacagcctgt agaaagccat ctagttggga gtagggaact cgaaacgtca acgcggaatg 29520 
atcgggtgag ggggatgaat cgggggtggg cgtggtctgt tccgcaccga ggaaccctga 29580 
gagcgtgttt gataggaagg ggatactggc gtcgacagac gcacccggta gatcgagggt 29640 
gacaagagcc cagtgaaagt cagactaatt gagagtagca cagggacggt ccggtcagca 29700 
cgcagaattt ctcggaccaa tggagcgcag gagatctatc atacctggcc atcctccggc 29760 
accttaggtc ggtcgtctat ctgccatttc catgcgctag cctctcccag gctaacagtg 29820 
cggcttcccc agagaaatga ctggacgttc aagggggttt ccttggccca gggggccaga 29880 
cctctgtgag tagcctgata tgtcccttcc agcacatgaa gatgggtcgt tgccgcggtc 29940 
agcgccgtga ctagcagagt gtgaatggga tgtcgacacg cgtgtcccac aatggcacgc 30000 
aatgctttgg tcactcgatg ctgcaccccg ccagggtccg gctttctaac caccggatcc 30060 
attgcgatga ctctcttggt gcgcgagaga ggggtgtgag tgcttgtgga tgatgtaaag 30120 
acaagtacaa gattcgcaga aaggtcaaag agactagaaa aaaataaaat aaaaaaatta 30180 
aaataaaatt aaaaaaaacc cctaaaqaac aagaaaagaa gagagaaaga aagagagaga 30240 
gagagagaac caagcctcag ggaaggaagg aaataagcca aggcaactcg cttggtgcga 30300 
cccaatgcta ggaggtccat gagactccgg tacttcctgg taatattgag ccaaatatac 30360 
ttattctagg taggcggtag ggattttaat gcgtatgcct atgatacaat aatggtacat 30420 
gaaggcgact gcacaaacaa tcttacgccg tttggctcgc tggctagaag agtaagcact 30480 
aaacaaggcc agggttgggg aaatcttctc cactgtcaga cgtccttcgt accttagagg 30540 
tactacctta ccgaggtaag gtactttatc tgaacacgga attacatcct cttaccacga 30600 
attttcagat ctcccaattt tgtaggttat gaaatattga aatttattga cctgcccccg 30660 
tccattggct aggtattaca cacggctgta tctctatctc atagaatggt atctagtacg 30720 
ctcttttcga tattacaaag gtggcttttt caggaaattt ggaaatatcc agtgggaaga 30780 
gggtgtacgt actctagtag gtaaggtacc ttacctacct taccttctgg agtcctggta 30840 
agattgtttg tgtaatgaat cgatcagctt agtgcgctaa aattcggtca ccctgtagaa 30900 
acataacaat atttcctttc ttcagcccct acctaaatta ggtgttcaaa gccaaggagt 30960 
aatacatacc gtcagtattt ctaggttggc aggagggatt ttgatgctgt cacccttggc 31020 
gcatcatgca cgattcatga aacagactgc actatccgtc ttacgcgcct tagcgccgcg 31080 
gctgaacgag taagcactca gccaaggagc ccaccccctc ccggccccac cccctggcag 31140 
ggctgtcaaa gcggatcgag tctcattggg tttgcttatc aaaatcggtt ggtaatccag 31200 
gtagcagtag catctggatg taaactaata aaaaggcaga ctgtcgggac cttaaaaggc 31260 
caaagaggta cccatgcccg ggttcttcgg tcgccagaac tgcaatgtat caaatgtctg 31320 
atgctcatga cccaggacac agcacagtac aggggggcaa tggctgcaga tcaaggtata 31380 
ttcacgaact cggtcactct ctcgccagtg gagggttcac gcaccggtgg aacattaccc 31440 
cgccgtgcat tccgacgctc ttgtgatcgg tgtcatgcac aaaagatcaa atgtactgga 31500 
aataaggagg ttactggccg tgctccctgt cagcgttgcc agcaggctgg acttcgatgc 31560 
gtctacagtg agcgatgccc caagcgcaag ctacgccaat ccagggcagc ggatctcgtc 31620 
tctgctgacc cagatccctg cttgcacatg tcctcgcccc cagtgccctc acagagcttg 31680 
ccgctagacg tatccgagtc gcattcctca aatacctccc ggcaatttct tgatccaccg 31740 
gacagctacg actggtcgtg gacctcgatt ggcactgacg aggctattga cactgactgc 31800 
tgggggctgt cccaatgtga tggaggcttc agctgtcagt tagagccaac gctgccggat 31860 
ctaccttcgc ccttcgagtc tacggttgaa aaagctccgt tgccaccggt atcgagcgac 31920 
attgctcgtg cggccagtgc gcaacgagag cttttcgatg acctgtcggc ggtgtcgcag 31980 
gaactggaag agatccttct ggccgtgacg gtagaatggc cgaagcagga aatctggacc 32040 
cgtgcgtcgc cqcattcccc aactgcttcc cgtgagagga tagcacagcg ccgacaaaac 32100 
gtatgggcaa accggctaac agacttgcat atgttctcac tagatcccat cggaatgttt 32160 
ttcaatgcgt cacgacggct tcttactgtc ctgcgccaac aagcgcaggc cgactgccat 32220 
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caeaacacac 
aiiatcaaatg 
acccaoccCa 
gacaccaccc 
crcccta; z c 
cgcactacct 
caccccac ".'C 
gcccCQcCcC 
cgcct ttict 
tcccgagg t c 
cgcaaacaca 
gcctt tggaa 
tctcagtggg 



tagacgaa tg 
tgcgga ttc: 
gccata tgag 
gcagcagcgg 
ccgagctc: i 
tacargt: tag 
agggca;;gc 
cggcgaccaa 
t ca t g 1 1 c 1 1 
gaacca t cgc 
aaca tggca t 
gtgtgtgcaa 
gtgggaaga t 



z t cacGcacc 
gaccgccata 

CCCcClCCcc 

ccacagcagt 
ctccta zzzz 
cgtacaa t tg 
agcticcacc 
t 1 ccgcaaga 
gagcca cgaa 
agcactgcga 
gcicagagac 
ccact gcgta 
a taggaaa tc 



aagaaccuct 
tcggagu tgc 
ggcagicga t 
c z zgacacca 
caccccctca 
c:gcgcgaga 

cCCctGcCCG 

cgcgaggagc 
ggggct t z cc 
cgatgctatg 
ctcaacaa ta 
gcgccctgtc 
aaacatcacc 



t tacggcagt 
ccctgicgca 
cccactcgcc 
tacccttctt 
cacacgccct 
aigagat tac 
gggaaccagg 
agccgaccac 
aggaggca aa 
aggatatctt 
t tcct cca tg 
at tcggtgcc 
acatcaactc 



acacigt zac 
aattaggcgc 
gage agagac 
t a gc ga ga a c 
azzczcqccz 
t ctgggagta 
cgagga ta ta 
cccagcggc t 
g icigct ggt 
1 1 ccctcgcc 
aaccaa t cca 
ggacagagtc 
t zacacccac 



32280 
22340 
22400 
324 6C 
22520 
2 2 58 0 
22640 
32700 
22760 
22820 
3286C 
2 2 94 0 
23000 



<210> 19 
<211> 31328 
<212> DNA 

<213> Aspergillus terreus 
<400> 19 

ctaccacgaa acttcagatt ttcctatctc aeggecttta gaatactgac ateegttgat 60 
ctttccctat tttctgtgag gtagtaagtg tggcagtatc tccctcctat attnggcagg I2C 
ctcttttcaa tattatatat agaagtggcc ttttcttaga aattggaatt tectggegag 180 
agggcgtc§t ctgagactct agtcacttcc aactattgea gctacatgga ggagtacctg 240 
ttgactaagt tgtactctgt gatttgtttg ataaacactc aataggtttc tagcaegtre 200 
cagttgtagc tgagcactca actagctaac gggtgagaac attgaagegt caattctttt 360 
tacceggaat acttttaaca gctatgtatg gttatttggg aceggggaag gaattaaatc 420 
aegcaggget tcttcttcac tcccatgcga cctactcaca ttgtgttcaa aagtaeggae 480 
gtgtctggat caatggctcc gaeagaegge caagaaaacc aatgcatttg ccaagaatac 540 
aaatggctac gtactagtcg agtagggege ggtcctgett gccgcctcca aaggaaaacg 600 
ggcatttact agatttgttt gttqcaggta ttgtactcgg atagtccagc ccagggatca 660 
attgactttg agacaagcag cgggccccgg cggccaatcg gcttttaagg egggtagggg 720 
tcgggtggtc tactgtatgc ggttcctccc gatcaccaca ttaegtctta cattgeatae 780 
ctaggtaaat cttatggtac tgtaggtata cacaatattg gtgaaagtta ccggtaaaaa 840 
gaaaatggaa aaactagaaa ggcagaaatt ttaccggtaa gataagtgaa t ttcgtttta 900 
ttegcaggge atgatggtat gataggtaca tttacgaaaa cttgaactca tgataatacg 960 
gtaccacaga cagcttaggc gcccctgcat cccctttgta ctctcgtact ccatatagaa 1020 
aagcttgtat gtttggctgg acctgctgcc aegagtegge ctggtccgtc ttactacttg 1080 
tatatcagag ccgctttgta aggttttggt tcgcctcggt tttgegatte catatggggg 1140 
cttttagggt aeggaaatec atgtaaacaa taccagctga agtacctact aggtgccacc 1200 
taggegctaa ggtacctaag tacatcttca tcccaaaaac aaagataaag gagttctaaa 1260 
ttacgttaga ttccgcattg ctgcagcgac acaegtgatt ttaggcttcc catcaccagc 1320 
aagctcacct ccaggaggcc cctttccctt ggacctgact aaggcatcgt gttcgccggc 138C 
aaccttacca taaggtagaa tgacatccca ccacggtgaa acagagaagc cacagagcaa 1440 
cacggctcaa atgcagataa atcatgtcac tggectcagg etaggectgg ttgrggtttc 1500 
agtcactctg gtggcgtttc tgatgetett ggatatgtcc atcattgtca eggtcagcat 1560 
ggcaccagcc tggagattgc tccgagcctt ggagacaact gactcttcac attegcagge 1620 
gattcctcac attaccgccc agtttcattc ectgggegat gtcggatggt acggaagtgc 1680 
gtatcttcta tcaaggtgat cgattttcca acccatgccc tcttcctttt ctccagccgg 1740 
gtttctattg actccacgac acgctctagc tgtgccctcc aacccttgac aggcaaacta 1800 
tacactctgt tgaccctgaa atacaccttc ctcgcttttc tcgggttgtt tgaaattaga I860 
tcggttcttt gcggcactgc tegttegtea accatgttga ttgtagggcg agcagcggcc 1320 
ggaatgggag ggtegggget caccaatggc gcaatcacca ttctgtcggc ggcagctcca 1980 
aagcaacagc aaccgegtaa gtactgatag ccagacctat ctcaaccgtt gttatgetat 2040 
gctgacccgg atatttacac atagtcttga ttgggatcat gatgggccgt cagttcgcca 2100 
acccattggg atccccggaa atcatcaagc atagtttctg actccattcc cagtaageca 2160 
aatcgecatt gtatgtggac cgttgcttgg gggtgctttc acgcagcaca caagttggcg 2220 
gtggtgtatg tatccccatt ggatttatcg gttcagtgct tgetttctea aaggaccttg 2280 
gctacgactc cgccacgtca agatctttcg ctcacggtga ttctggtcca ggtttttaca 2340 
tcaaccttcc cattggggcg tttgecacat ttctccttct cgtcatccag atccccaaca 2400 
gattgecate caegteggat tcaaccacag aeggcacaaa ccccaagaga agaggggctc 24 60 
gggaegtett gacccaactg gatttccttg gattegtget cttcgccggt tttgegatea 2520 
tgatatctct tgctttggag tggggtgggt ctgattatgc gtgaaatagt teegtgatea 2580 
tcggcttgtt ctgtgcggcg ggcgtgtcgc tggtgctgtt eggatgetgg aaacggcatg 2640 
tcggcggtgc agtggccatg attcccattt ccgtggccag tcgtcgccaa ctctqgtgct 2700 
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cctgcttctt cctcggcttt tc::ccGccc ccctacttat tttctcctac zaccz ccct a 276C 
tcracttcca ggccgtcaac 5a:cr:tctc ccaccatcac tcgagtgeeL a :ccicccgq 2G2C 
gcattgctgg acaga tcci c atcgcaae 10 tgacggotgc aazcaqzzga g z ccccacca 2880 
ttccaccacc tttcttcccc tataacc:at ggcgctacig acaaatzgag gctggtagtc 2 94 0 
ggtaaaacag gclatiaccz LcccicggcQ cicgcaagcg agaLCcirgt gtccatatcc 3000 
gccggactgg taccgacctt ccagccggaa 5cc:coa: c g cagcatgggt catccaicag 3060 
ttcctgggag ccg i gggccc agga :ccgga acgcaaaccc taggtgacct ggazcgtttc 3120 
catcggtttg cgccgcactc tcatccaaat gctca ttgac tcggctgtcc ctcci t tagc 3180 
ctgtcgtcgc cattcaaaat gcgctgcctc cacaaacgag ccccatcggc atttcgctag 3240 
ccatgttcgg ccagacattc ggtggctcgc tttttctcac cctgaccgaa ttccutttta 3300 
gcaatggttt ggactctggt ctgcgccaat atgcgccaac cctcaatgca cacgaggtaa 3360 
cagccgcagg ggccaccggc ttccgccaag tggtccccgc ccctctcatc tctcgggtcc 3420 
tcttagcata cagtaaaggc gtggaccatg cat tctacgt tgcggtcggt gccuctggag 3480 
ctacctccat cttcgcctgg ggnatgggcc ggcctgcctg gagaggctgg cgcatgcagg 3540 
agaaaggacg gagcgaatga atttaatcct atcgtaggga acgccaaaga aatamaata 3600 
tttctatgga gatagccatg taccgtttgg caactcatac actctaccta ctcttctcaa 3660 
accaaacgaa ccatttttat gacaggaaga gcaaiaatca atcacgaacc agttgtgacg 3720 
acagacccgt tcaactgtgg cctttttttg gcgcttcatg ggtaaacact tcagtgttga 3780 
tgagattcct agcttgacct gagcagacct aaacctgcat tacctagtca tggaaatatg 3840 
cattcgaata cctgacatgt tctgttacga atccacggca tcggtatatt caatggacat 3900 
ggccttccag tatttgcttc tcttaaagaa agttccatgg ttccattgta atggttaact 3960 
gacggcttcc ttgtacacat tggccatcac aggtgatttg agcaagacag tcgaggagat 4020 
caacattgag attcggtatc aagcactgat accacacaga actggcgggt atgaagcagc 4080 
ccgtcttggt agtgtggcag agaaatcgtg gcgtgtagca cctacgcttc agagcaacta 4140 
tcgcgccctc agacccgcag accaactgga gtaccggact cggcgttttg gtcggcgtta 4200 
tcaaccgggg ttattgtacc gccaacccgt gcagctacct agctcactgt ggagaatgga 4260 
taatgctagg cttcggtctg acgatgtcat tct at tgcug atgatgttcc tggcacccct 4320 
ttatctacca tcctggcgag gtcgaatcag cccgcgtccg tcatactcca ggtctgcgca 4380 
gtaaggaagg cacatccatc aaacatcaaa ctagaagaaa ttggccacga tgacaccatt 4440 
agatgcgccc ggtgcgcctg ctcccatagc tatggttggc atgggctgca gatttggcgg 4500 
aggcgcaaca gatccccaga aactgtggaa attgctggag gaaggaggga gcgcctggtc 4560 
taagattcct ccttcacgat tcaatgtcgg cggggtctac caccccaatg gccagcgggt 4620 
aggatcggtg agtatgaagg attctgggtt gagcattttt gaggcccata tcttcctgtt 4680 
cagaacgata ggcgttgact gcgagtagat gcacgttcgc ggtggacact ttctcgacga 4740 
agacccggct cttttcgatg cctcattttt caatatgagt actgaagttg ccagtgtacg 4800 
tcccgcgatc gttgtccagt tgtgtatgga tcagaagcgg aataaaccca tgctaagact 4860 
gccgaatagt gtatggaccc ccagtaccga ctcatacttg aagtcgttta tgaggcgctc 4920 
gaagctggta tgtattatat tccttggttt cccacgtggg tattaactcc ccacggctcc 4980 
gcagcgggaa ttcctctcga acaggtctcc ggctccaaga ctggggtttt tgcaggaacc 5040 
atgtatcacg actaccaagg ctccttccag cgccaaccag aagcccttcc acggtatttc 5100 
ataacaggaa atgctggcac catgctcgcg aatcgcgtct cccactttta tgaccttcgt 5160 
gggcccagtg tctcgatcga cactgcctgt tccacaacct taacagcctt gcatcttgcc 5220 
attcagagct tgcgagctgg agaatctgat atggcgattg tcgctggcgc gaacctgcta 5280 
cttaatcctg acgtctttac taccatgtcc aaccttgggt gagtctggtg ttcaatccat 5340 
ctagtgatca gcattcttgt tgcacagaca atatgtgatg ttaactgtga tgtgctgcga 5400 
ccagcttcct ttcgtccgat gggatttcct actcatttga ctcgagagcg gatggctatg 5460 
gtcgcggaga aggagtggct gcgatcgtct tgaagactct gcccgatgcg gtgcgagacg 5520 
gagacccgat ccgcctcata gtgcgcgaaa cggcaatcaa ccaagacggc cggaccccag 5580 
ccatcagcac gccgagcggc gaggcccagg agtgcctgat ccaagattgc tatcagaagg 5640 
cccagttgga cccaaaacag acttcgtacg ttgaggccca tgggacggga accagagcag 5700 
gagatccgct ggagcttgca gtcatctcgg ccgcgtttcc gggacagcag atacaggtgg 5760 
gctccgtgaa agccaatatc gggcatacag aggctgtcag tggtctggcg agtttgataa 5820 
aggtggctct ggctgttgaa aagggggtta tcccgcctaa tgcaaggttc ctccagccga 5880 
gcaagaagtt gctcaaggac actcatatcc aggtagcatt atcttcacga ttttttcctc 5940 
tcattctatt ctttctattc cagctcctcg ctgatttaca aacagattcc actgtgtagc 6000 
caatcatgga taccaaccga tggtgtccgt cgcgcatcaa taaacaactt cggtttcgga 6060 
ggcgcaaatg ctcatgcaat cgtggagcaa tatggcccgt ttgcagaaac atcgatctgc 6120 
ccacctaatg gttattctgg caactatgat ggcaatttag gaacggatca agcgcatata 6180 
tatgtgctga gtgccaagga tgagaacagt tgcatgagaa tggtttcaag gctgtgcgac 6240 
tatgctaccc acgccagacc agccgacgat ttgcaattgc tcgcgaatat agcatacacg 6300 
cttggttctc gtcgctcgaa cttccgatgg aaggcagtat gtacggcaca cagcctcacg 6360 
ggtcttgccc agaatttggc gggagaaggc atgcggccaa gcaagtcagc cgaccaagta 6420 
agactgggat gggtgttcac aggccaggga gcgcaatggt ttgcaatggg tcgtgagttg 6480 
attgagatgt atcctgtctt taaagaggcc ctgctggaat gcgatggata tatcaaggaa 6540 
atggggtcaa cctggtccat tataggtaaa gacccgcaac aagtccccgg cccaggctat 6600 
ggaaagcact cactcatgtc accattgcag aggaactcag tcgccctgaa acggaaagtc 6660 
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gcgttgatca ggcaga a zlc agtctgccat 
gtctgctctg gtcgtggaac atccaaccag 
cagccgcagc gcacgccaic ggggcaccaa 
tacgcggtgc attgacacca agagaccgcc 
ctgccggatt gagccgcact gaagcgggta 
aagaatgctc ggnggtgcgg :gtatcaaca 
tgtccgccat tgccaagitg gaggaactgc 
tgaaagtcac ccaagcctct caczccagcc 
ccggtcttac agaactcttc ggagcagacc 
tgatctacgc ttctcccaga accggggccc 
ctatacactg ggtcgaatgc atgcttcacc 
tgtgcctgga cgaaaacgac cacatgccaa 
acggagcgct tggaggcccg atcaagcaga 
acatccctta tctgtcccgt ctttctcgtg 
tcgcatcaga acttatccgg gccggatttc 
gcggatgtga agcagctcgg gtccaagtct 
acgagaccag atactggaaa gagccgcgca 
cagtccacga cctgatcgga ttgcaggagc 
acaatgtgct tcgtgtgtca gatttgccat 
ttgttttccc tggggctggg ttcgtgtgta 
cgtccgacca tgaatctgac gacatcagtt 
ccctgattct acctgcggac ggggaagaag 
ccgatcagag tctgggttca caggactggc 
acaagaatga ctggacggaa cactgtacgg 
cctccagttt gtcgaaccaa caacggatag 
cgcaggagct gtgggactca ctacatcggg 
acattacgtg catcgaaagc gacgggcgag 
cggcctccgc aatgccacac gcctacgaat 
actctgcagt tcaggcagcc tataccactc 
cgatggtccc cgctcgcgtc ggctgcatga 
ccagggacat gctgcgcgca caagcgaaga 
ccgatgtagc agtttttgat gaggcagatc 
ggctggtctt tcagtctctg ggggcaagtc 
ccgggaatac ttgcagctcc tggcattggg 
ggcttgaaaa aaccctgggc acaggtattc 
ttcgacggtg ttcggtgcac ttcattcaag 
tcgagaggct gagtggtcat ctggccaaat 
gtgcccaaaa tggcgagctg gggccagaga 
caagatgcag cctccgctct agagtggttg 
gcctgggctc cgtgctcccc gctatcctac 
tggatggcca cctgttgtcc cgctactatg 
cgcaagccag cgagctcgtg cgcctctgct 
aaatcggcgg aggcaccggg ggttgcaccc 
cgccggtagg ccgctatgac tttactgacg 
agcggttcgc gggatggcag aatgtgatgg 
cagaagcgca ggggtttgtg tgcggatcct 
atgccacttc taacatgcag cgcacattga 
gcaaactcat tcttgtcgaa accaccagag 
ttctgcccgg ccggtggctc agcgaagaac 
ctacgatgtg gcgcagcatg ctgcacacta 
gtgactgcga tagccacgag ttctatatga 
cgactccgat gtcatgctcg gtcaaattgc 
ctacgcccat gtcttggata tcagatttgc 
tcacttcgct acaggcactt cgtcaagttc 
ttggagaggt ggaacactcc atgcttggtt 
cctcaatgct acagctggct gggggaactt 
ctgatgatcc cctgaaggct ctacacctcg 
atggcaagcg atttgtctca cttgacctcg 
cgcgcgatgc cattgtcagt gttctggatt 
actatgcaga gcgggatgga gttatccatg 
gaggcgagga agacgggtat gccttggagc 
tagatataca gactcctggg ctcctcgatt 
catatgaacc agataaatta ccggacgact 
ttaacttccg tgacatcatg gtcgcgatgg 
aatgcgccgg cgtggttaca agtctcagcg 
cggtcggaga tcgggtttgc gccctcatga 



tgtctacgcc tc'::caaatt gcgcttgttc 6720 
tagccgtcac cacicactcc agcgaagaac 67S0 
caccccccz c cgccactgga ataagctata 68 4 C 
tggcgtcggt acaiaagggg ggcatgttgg 6900 
tatacatcac acagcrtcca ctacagagtg 6960 
gcccgtcgag tgtcacggtc tcgggagatt 7020 
tccatgcuga tcgtatattt gcgagacggc 7080 
acatgaactc gatgacagat gctttccgag 7140 
ccagtgatgc agcaaacgcc agtaaagatg 7200 
gcctgcacga catcaatcg: cctcgggatc 7260 
cggttgagtt cgaatcagca ttccgtcgaa 7320 
aggtcgatag ggtcattgag attggacctc 7380 
tcatgcagci tccagagctt gccacgtgtg 7440 
ggaagagctc tctcagcacc cttcgcctnc 7500 
ccgtcgactt gaatgcgatc aactmtcccc 7560 
tgtctgatct acccccctac ccttggaacc 7620 
tcagccaatc tgcccggcag cggaagggcc 7680 
cgttgaacct gccgntggcg cggtcatggc 7740 
ggctacgcga ccacgtcgtc ggctcgcata 7800 
tggcagtgat gggaatcagc acgctctgct 7860 
acatcctacg cgacgtgaac tttgcgcagg 7920 
gaatagatct gcgccccacg atttgtgctc 7980 
aaagattctt agttcattcg atcactgctg 8040 
gacttgttcg agcagagatg gaccagcctc 8100 
acccacggcc atggagccgt aaaacggcgc 8160 
tgggaattcg tcatgggccc ttttttcgaa 8220 
ggtcatggtg tacatttgcc atcgcggaca 8280 
cccagcacat tgttcaccca accacactag 8340 
ttccattcgc tgggagccgg atcaaatctg 8400 
agatttcctc ccgacttgca gatttggagg 8460 
tgcacagcca aagtccttcc gcattggtaa 8520 
cggttggagg gcctgttatg gagctcgaag 8580 
tgggcacttc tgac'cgggac tccaccgacc 8640 
ctccagacat cagcttagtt aaccccggct 8700 
aggagcacga gatcagcctc atattggagc 8760 
aggccatgga aagtttgagc gtaggcgatg 8820 
tctatgcgtg gatgcagaaa caactggcgt 8880 
gctccagctg gactcgggat agcgagcagg 8940 
ctggtagcac caacggcgaa atgatctgtc 9000 
gtcgggaagt tgatccgttg gaggtgatga 9060 
tcgatgccct caagtggagt cggtccaacg 9120 
gccacaaaaa cccgcgcgct cgcatactgg 9180 
agctggtcgt ggactccttg ggcccaaatc 9240 
tctcggccgg gttttttgaa gcagcccgca 9300 
attttcggaa gttggacatc gaggacgatc 9360 
acgacgtggt gttggcttgt caggtcctgc 9420 
ctaatgtgcg caagctgttg aagccaggag 9480 
acgagcttga cttgtttttc actttcgggc 9540 
cagaaagaca gtcgactccg tcactaagcc 9600 
ctggattcaa tggtgtggaa gttgaggctc 9660 
ttagcaccat gatgtccacg gccgtacagg 9720 
ctgaagtgct cttggtctat gttgactcat 9780 
agggagagat tcgcggcagg aattgttccg 9840 
ctcctaccga gggccaaata tgcgtattcc 9900 
cagtcaccaa cgacgacttc acacttttga 9960 
tatgggtcac ccaaggagcg acaatgaagt 10020 
gattactacg taccatgcgt aatgaaagcc 10080 
acccttcgcg taacccatgg acaggcgatt 10140 
taattagcat gtcagatgaa aaggagtttg 10200 
ttcctcgggc atttagtgac tccatcaatg 10260 
cattccagga cagccagcat ctcctgcgac 10320 
ccctgcactt cacaaagcgc aatgtggaca 10380 
gggtagagat tgaaccgagg gcgtttggtc 10440 
gtcaattgga atcaaacgtc atgggcttcg 10500 
agacagcaag aacaattgca cccgggcttg 10560 
acggacactg ggcgtcgagg gtgaccacaa 10620 
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ccccga cca a cctactgcoc a t :ccaqaga ctctcag::: cccgcatgc: gcctccatcc 1068G 
ctccggcct: cacaacacc: tacatttcac c c tacaccc: tgcccgcatt ctgccaggtg I 0 7 < C 
aaacggtc:: gaiiccacccc ggcgcagcac acgiccacca gacggcca: l attcttgcic 1080C 
aattaaccgc tgctgaagtc tttacaactg ctggcagtga gaccaagcgt aaccttttga 10860 
tcgacaaa:; ccacctcgac cctgatcatg tcttctcgac cagggactcc agcttcgtcg 10920 
acggtatcaa gacccgcacc cgtggcaagg gggtggacgt ggttttgaac tcgctagctg 10980 
ggcctctcc: tcagaagagc tttgactgtc tggctaggtt tggtcggttt gtagaaatcg 11040 
gcaagaagga tcttgagcag aatagccgac tcgacatgtc gacgttcgtc cgcaatgtct 11100 
ccttcccc:c cgttgatatt ctctactggc agcaagcgaa gcccgctgaa atcttccagg 11160 
cgacgcccga ggtcatcttg ctgtgggagc gaacggcaat cggcctgau catccaatat 11220 
cagagtatcc tatgtcggcc ctggagaagg cctttcgcac tatgcagagc ggccagcacg 11280 
ttgggaagat tgttgtgaca gtagcccccg atgacgeggt cctcgttcgc caggaacgaa 11340 
tgccactat. cctgaagcct aacgtgtcgt azcztqzzqc tgggggcctg ggtggtatcg 11400 
gacggcggat ccgcgagtgg ctggtcgatc gcggggcgcg atatctcatc at tctgtctc 11460 
gaactgctcg cgtggacccg gtcgtgacga gtctccaaga gcggggctgc accgtttctg 11520 
tacaggcgtg tgatgtggcc gatgaaagcc agcttgaagc ggctctccaa cagtgtcggg 11580 
cggaggaaat gcctccgatt cggggcgtca tccaaggggc aatggttctc aaggacgccc 11640 
tcgtctcgca aatgacggcg gacgggttcc atgccgccct gcggcccaag gttcagggaa 11700 
gttggaatcn gcaccgaatt gcatcggacg tggatttctt cgtgatgctc tcatccttgg 11760 
tgggtgtcat gggaggcgca ggacaagcca "actacgcggc tgccggagcg tttcaggacg 11820 
cgctcgcaga gcaccgcatg gctcacaacc agccagcggt caccatcgac ctcggaatgg 11880 
tccagtcaat tgggtatgta gcagagacag attctgctgt ggcggaacga ccccaacgga 11940 
tcggctatca acccttgcac gaagaggagg ttctggacgt cctcgagcaa gctatatctc 12000 
ctgtgtgttc ccctgccgca cccacacggc ctgctgtcat cgtcaccggc atcaacaccc 12060 
gcccaggccc tcactgggca cacgccgact ggatgcaaga ggctcgcttt gcggggatca 12120 
agtatcgtga tccgttgagg gacaatcatg gagctttgtc gctgaccccg gcggaagatg 12180 
acaatcttca cgccaggctg aaccgtgcaa tcagccaaca ggagtcaatc gccgtgatca 12240 
tggaggcgat gagctgcaag ctcatctcaa tgttcggcct gacggatagc gaaatgtccg 12300 
ccactcagac attggcgggg atcggcgtgg actccctggt cgccattgag ctccggaact 12360 
ggatcacagc taagttcaat gttgatatct cagttttcga gttgatggag ggccgaacga 12420 
tcgccaaagt cgcggaagtg gtgctgcaga gatacaaagc ttagatatat atgtatatgc 12480 
atatctctcc ctatagctac atatatattg acatgcctcg atagtgtctg attttctctc 12540 
ttacagcatc ccgttctgag atgcagattt gtttcttgcc tagcgtgaat acgtcacttg 12600 
ttgtgtgaca atgaataaat cagagccata gccatgcaag cgtaatccta tagagtcctt 12660 
ggatgagacc aaggccatgt atcagcgcag cacatcttgc tgtctctttt attcattaat 12720 
gctcgtccgt agaccggaaa atggttttat tatcttaggt ttccctatca ataagatcct 12780 
tagctgggtc cgcagggtta agaggccact gccacaatcc aggttaatag tctcaacatc 12840 
gtggtgctgc aacgtctgat tgaggatctg tgtaacatct acggatggta aggaaaattg 12900 
gttgaagaat gttgcatagc cctttgagat gtatgtagag ttgtttgggg caggttcggg 12960 
tttacgctct aggatctgct gcagattacg ggttgaaacc ccccagacgt gctgggttta 13020 
gttagtgtag gttaattagg ataggtctat gaagctactt actacttact aaggtttgta 13080 
gtcttgaggg tagctgggaa tccatgtcgt cgggccatac tttgggaaac actttagaga 13140 
cacggtgcaa cccaccatat cgcccgcacc cgccacgctg cagccagagg gaatagaaag 13200 
aaaacagaag agaaacgcag ataaagaaca acggcgttgg attcctgcaa caagccttgt 13260 
cccacgctac ctgcagcgcc tcccaatttt cacatgggtc cggcggaacg aacgtgagac 13320 
aagatgctac cgccgagtgc gcagccggaa gggatttggg acacggaggc gtgtctaacc 13380 
ctatattggt taggtgtctt aagtgatgtt tatcgtcggt tcgcacagag ttacacgaat 13440 
atcccctttt cctgatatga tgtatattta tatgtaaggt gccatctcca catacatcca 13500 
tccatccatg gatacacccg tcttcactct gtaccttatg gcttcagcag acaagaatcc 13560 
ggaggagaca tcgcttaaga attcctgcgt cgccatacct cccgtcaata tcaccaccgc 13620 
caccccaact gccctctccg cctacgcccc ctcgcgccaa ccctcggagc ccaccctcat 13680 
cgaagaagag aaactcctga cccacaaaca caccttcgaa atcttatcaa gatgggtcta 13740 
ccgccacatc gtcacctcgc caccgctcat cacatcccca catctggccg atgtcgccgt 13800 
acgagctctc caacgcgccg agccgggttt ttctctccac gacaagggag atactacata 13860 
tacttctgac tcggaggatt cgactccata taccgacacc gacacagacg aggagttact 13920 
ctactcgtat gcggcaattt gtccggcgtt ttcggcggga gatgcagagt atcaacgtca 13980 
ccaccggatg ggcgtatatc ccatgaattg ggtacgtatt accgacatat atccttgcaa 14040 
cgcaactcta ctgctaataa gttagcagtg" ctatt ttcga ttgattgtgg gttgcaaagc 14100 
cagacacata ccagaacccc aaggcactct ctaaacgggc ctatctagta gtctctcata 14160 
tcgaattatg tcctcctcca taataagacc aaattcgctt tggctagtca tgtcaatgcc 14220 
aagtggaaag tcggagaagt tgtcaatgcc aagtgaaaag tcggggaaac caaggcattg 14280 
gccagcatca ttgctcagca aatcctgggt tgcatcctca gaaggagtcg caattgtacc 14340 
atccattggt atttctctcg ggttgaagta tatgtcccag acctggccgt ctccgggtac 14400 
accaagcggg ctgatattca agttatatga agagccggtg ctagttgtgc tagtggacgc 14460 
attggcgaca tcgtaatcca tgtaggttgg ctcgttccat gagggcgttg gacttcgtga 14520 
tatcgttgtc ggaataaaac gcctctccct taaattctgg aggaggaggg caatatcgct 14580 
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acagcgactg gccacttggc t caan t cgta cgagatagai: gcagatgacc tattcgatgc 14 640 
cgaacggtcc gtczazazqc ccaagaaac: oagt6tc:cc atgtcctgtg aatggatcgg 14700 
ctggtct ncc tccacaatgt cgacaiaga: icccatgaa: gagatcaaag aatacttcag 14760 
gacaatgcta tgaagtacat ccccaggtta ggatgctgtc aagagggtcg cacggaagac 14820 
ttacctgtcg aaattgctga aaccagattt acacccancg cacaattgct tgaacagcat 14860 
cagacaatcc cgagcgtgtt gtaggcagac agcgcccctt ccccggctcc aaatgcgttt 14940 
atgcagaagt acccaacagc tcgagagtgc aaacctcaac tgcgttgcac agatatactc 150C0 
ttcgagaggc ctattcggat cccccgcacg cagtacaegc tcatgttgca cgtgccactg 1506C 
gctaagtttt cgagcaattc tgcggatacg gcggcggacg cgactctgtt: ccatcztgcu 15120 
ggattttgct gagtataagc ccagatatat ctcctcaaga atgaatgcca accg; a t gcg 15160 
tgccgcaaaa gcatttggta gtggcgcagc ggagtcatat ggaggcaatg gcaccctgca 15240 
gtcatacgag ggcaataggc aggcctngcc cccaacgatg gatacgtgct eccaaatcct 15300 
atagtaagcg aaacgctcag aaaacgaagc aagttttcga cgaacatcga cgaggaagat 15360 
gctccagaac aagtctttag gctggtcgcc cacagcccca tcttccgttg gcggggtcgt 15420 
tgtcgttaaa tggagtccaa tgagcctgga caactcgcag acttgagcaa agatagtcag 15480 
aaaaattgcg aaatcaaaat actccattgc cacaagtgcc ttagtataat aaagtcaggc 15540 
cacgttccaa aggtagggtt gtaagacgca ccaaacaaaa gagggcccgt atattcgata 15600 
gccggggttt aacaagagtt tccaatcgat tatagcactg cctgatatta cgcagcaggc 15660 
ttatgatcat ataatctatt ggtatgtctt gtcctgtgca tcccacgact ttgttcgcaa 15720 
ttgatgtttg agtcagggcc tgaagcacaa tgcagttgaa agaggtaatc caaaccaggt 15780 
ccggaggccc agttctgatt gtgtactgag attccactgc gtccataata gtctgtcgac 15840 
taaatatagg gagtcgtgga tgcagcttct tgaaatattc gttgatagac ggctccacta 15900 
gagcgcgggg tggaagagac gggagaaagg tatcgttttc ggatgccgat agtgccggga 15960 
tatcctcaat gatctgttga agcgtttcat tcgcctcagc caccttcgag cgcacagctg 16020 
acacagcttg gcggtcaaat gcgaccatcc ctgcgacttg gccgctggtc atatcgaccg 16080 
gtaaccaaga ggttaggttg gcgttagcag cctgtgcctc catgacgagt gaaacaaaag 16140 
agccataacc gtggagaggc cagaatccat cttcctcggt gggtgaagtg tgtttgcctg 16200 
agtacaagtt tacttcacac tcttctgagc tggttgagct tgtgtacccg ctttcr tgtg 16260 
actgtggcct tgatagatca cgaaagccct gatgggcatc aataaaggac aggcatcgtt 16320 
ggagggcatg ttcgatatct ccaacccgac tcccgagctt ctcgacgtcg cgcgccctga 16380 
ctgagagtca gaaagacagg tatttgaccc aaatatagac catcttatca cttacaattt 16440 
tgctgtttgg ttgacccgtg taactttgtt ggagaattca cagggtaggt tatatcggag 16500 
gcaatgggag cactttggac cctccttgct gcatcggacc ttacggatcc ggcactcatc 16560 
acatgcagga aatgcctgcc gcttcgacga cagctctgca ctctccatca ttcaactcta 16620 
tgtgagatag aacatgagtg aacgatgagg gaaggccctg gtactcagat aaqcecttga 16680 
tgtctctgta gatctcaact gtgcaagacg aagactggtt tgaagtaagg cccacagctg 16740 
aagtggcatg ataagttcac ctgttggttt aatatacctt attgttgarg aaagcctgac 16800 
cttggccaaa gaagggagca gctgagtatc cgacatattt gataaggacc agcggcaagc 16860 
ttgggatctg ctgcatactc ggattacgcg aaaaggtagt cggagtacgt gaaaaggtac 16920 
tcggagtacg cgaaaaggtt agattaagcg ctacggacgc cggcagtccc tgaatgaaac 16980 
tctgcggggc ccacttgccg ctgacggtgg cacgatgaac cttctggccc ctactctaat 17040 
atagcacaag gtgctcgaat aacaaagctt gtcgccatcg tggcgttgac gccagcatgg 17100 
ctctgcattt gctgtactat acggtaagct tagtagggcc aaatttatac tatagtatgt 17160 
tatttccgat ttacgtacgg aatgggaaag aacttgtatg cggccgcagg ctgaaaaggg 17220 
ccgaattaaa gaactgccat aactatgtgc atcccgatga tgcctcaagc gcagcattca 17280 
acaggggcaa aatatcgtca caggtcagat ttattactac atcgggaagt cttcggagat 17340 
tgattccgag gcatagagtg ccttggacta gcgacgccaa gtttctacta tcaggaaaag 174CC 
tgctagttcc agattggaac tagccaataa gtagccagcg gtaagctgaa caactaattt 17460 
ttaaatatta gatgaacaaa tgcgtagttc agagacacca tgcacgttgt ccaatactat 1752C 
gaggctttag caagacaaag tggagcacac gccacctggt aaaagtgaac accgcctgct 17580 
gcaaaagctc gtggagtttt attttgatta ctagaaagca gacatcccct ccttgtcccg 17640 
gaatattgca aatcagggaa cattgaataa gctgggccgc taatgcgtgc taagaacagg 17700 
gagtccacca gagtgaggta gatcccttcg tatttcagct gtggcaggac agcgatctgc 17760 
ggagctccga cggctagccg tatggcattg acagtgaagg agatcggaac ctctactcca 17820 
gagtacattc cgaaaagcca taattattcg gatgtacgga atacggaggg atgataagcg 17880 
ctagaaatag ggtggatcct actaaaattc ggatccttta gcttgtgcta acgtgtcaag 17940 
ctggggcatc ctgtagtccg tgctactcct ataggtttac cggctatata aatctggtgg 18000 
agccagcggc ccctgcacct ctttaggaca cgcggcccct tcaaaaaagt caaaatagca 18060 
gtcaagagca gtgaaagaga cccatgctgt ctcatagaaa agcaaggaac ctcgaaatcc 18120 
acatgagatc gatccacatg gttctacttc aagtgggtca ttaattcccg tcacctgaga 18180 
catgagatac ttagtagcta agatatggta catgctgttg atcaaacgat atgagtgtca 18240 
tgttggggtg aatactaagg aacatttcta ccaacaatct cgcggtgaat agaaaactga 18300 
cctttaaatt ccagcacggc gaacgaacag catccatatt tcagtttggc tgcgtcaaga 18360 
acttgtagac tttctcgtac cttgacgctc tgtcagtgta caagaatatc atgcctgtag 18420 
attactccta aacttacacg ggaaacatga tggccgaggt aagggaaagt ctccctgtcc 18480 
gaaaccagac gccggaccag aaaaccccaa tgccttcatt gcgcagaagt gtttccacgc 18540 
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agttaaaggt atttccgtac agttatcttc cccgaaaaqe t tgcatcctg gtaacccgaa 18600 
gtcgcatcca agtcagcata cacctatctg ggcgggttac tgtcgatctg agcagcactt 18660 
accgcgtctt gaccacgccc agtggctgtg tegaccaggc gcaacaaacr ccagtaacag 18720 
aaccqaccaa ggtgcttgcc agagggtgca cgtcttcgcc gttcttcgag tattttcggg 18780 
ccagcccaat aagttcgtta taaacagtga acctcactgc cgcattggac gactgccgca 18840 
aaattgtagg accaaccgca gagaagaatc caagcggtcc ccgatctcga agcattccag 18900 
ctatcgcgcc gaaagtcgta cctaactctg catttccaac cttccttgca tcaa:gctaa 18960 
gagaacgaat taagaactga tcggataacg ggtgagatgt tgcaacttac attitcgtct 19020 
tgatcgcctc cgctcgggti acggctaaga cagcctcggt cacgccagcc ccaaacccag 19060 
ccaggacgga agctccagtc gagagctctc catttgggcc cgagagggcc gagcgataaa 19140 
tattgaagga ggcaaattct gtttgttatt gcaaacggtc attccttttc gctccacgcg 19200 
ggcttataag ggggatcgct acacacgaac ggaggctttc aatgtggttc ccaccaaggt 19260 
ggctccatac ccagcatacc agcctcggat tccaggtttt atagctgcca catcatggtt 19320 
tctccgctta agctgggcgc gagttttagc cgctaccagc ggaagcaata gactgtcagt 19380 
tggcggccac ggtaaagcgg aactcttaaa cacaggataa gctcacattc gaaagggtag 19440 
gtgatggaga tttcaactgc cccagcacaa gcacccgcga ccaatgcagg aatgcctttc 19500 
gtctatacat atattaagag aagacattgt cagtaacatg gcacacgcgc gaccaagaag 19560 
gaacacatac acgcttccca cgggcttttt gggtaagggg tgcctttggt aatggaacat 19620 
ttgtctggac tttagactcc atgacgagga tgtgctgagt ccaaacaaag cttttcttca 19680 
cagagatagg gctgcagacg ti-tatttccag ttcaccttcc ctgtgttcag tatcaggtct 19740 
tatattgtat tatctcaatg cttatgactc taagtggaat acattggata tcagtctgtc 19800 
acggagtcgg cacccgatgg ctatcgcaat cgtccctggt gggtcttgaa atcgtatgtc 19860 
acacttattc cggatgaaac acattccgga gcgcgcgttg atattgctaa acagtacaga 19920 
cccaaatggt ctgcagaagg ccctaaatag taggtctcat tagccagtat ttagttgtga 19980 
ttgcagatca ttgtcagcct aacatcagtg taggttacgg tgtgatattt acttgcatag 20040 
aaggttccag accacacggt tctagatcct ttgacagcag catgaatgga ttcccctcta 20100 
ggtgccgggc gccgacgtgt gcgttgctcc gaaatttgta ggacggagct cggataccta 20160 
gccgctatgg gcatcggagg ttgtagcagc gtacacgctt ggatagttaa ataatcggat 20220 
gtacacccac tgttggaaat gacgggggcc taaaacacga gattatctga tccaatttct 20280 
gttcgttggc attctatcat tcgcagcgaa gatcgtcctc ttaaattgac catgaccaag 20340 
caatctgcgg acagcaacgc aaagtcagga gttacggccg aaatatgcca ttgggcatcc 20400 
aacctggcca ctgacgacat ccctccggac gtattagaaa gagcaaaata ccttattctc 20460 
gatggtattg catgtgcctg ggttggtgca agagtgcctt ggtcagagaa gtatgtgcag 20520 
gcaacaatga gctttgagcc gccaggggcc tgcagggtga ttggatatgg acaagttagt 20580 
tctatccaat ctgaacagtc tacaaagtat actgacgatc ctttgtatag aaactggggc 20640 
ctgttgcagc agccatgacc aattctgctt tcatacaggc tacggagctt gacgactacc 20700 
acagcgaagc ccccctacac tctgcaagca ttgtcctccc tgcggtcttt gcagcaagtg 20760 
aggtcttagc cgagcagggc aaaacaattt ctggtatagc tgtcattcta gccgccattg 20820 
tggggtttga atctggcccg cggatcggca aagcaatcta cggatcggac ctcttgaaca 20880 
acggctggca ttgtggagcc gtgtatggtg ctccagccgg tgcgctggcc acaggaaagc 20940 
tccttggtct gactccagac tccatggaag atgctctcgg aatcgcgtgc acgcaagcct 21000 
gtggcttaat gtcggcgcaa tacggaggca tggtcaagcg cgtgcaacat ggattcgcag 21060 
cgcgtaatgg tcttcttggg ggactgttgg cccatggtgg gtacgaggcc atgaagggtg- 21120 
tcctggagag atcttacggc ggtttcctca aaatgttcac caagggcaat ggcagagagc 21180 
ctccctacaa agaggaggaa gtggtggccg gtcccggttc attctggcat acctttacta 21240 
ttcgcatcaa gctctatgcc tgctgcggac ttgtccatgg tccagtcgaa gctatcgaaa 21300 
accttcagag gaggtacccc gagctcttga atagagccaa cctcagcaac attcgccacg 21360 
ttcatgtaca gctttcaaca gcctcgaaca gtcactgtgg atggatacca gaggagagac 21420 
ccatcagttc aatcgcaggg cagatgagtg tcgcatacat cctcgccgtc cagttggtcg 21480 
accagcaatg tcttctggcc cagttttccg agtttgatga caacttggag aggccagaag 21540 
tgtgggatct ggccaggaag gttactccat ctcatagcga agagtttgat caagacggca 21600 
actgtctcag tgcgggtcgc gtgaggattg agttcaacga tggctcttct gttacggaaa 21660 
ctgtcgagaa gcctcttgga gtcaaagagc ccatgccaaa cgaacggatt cttcacaaat 21720 
accgaaccct tgctggtagc gtgacggacg aaacccgggt gaaagagatt gaggatcttg 21780 
tcctcagcct ggacaggctc accgacatta gcccattgct ggagctgctt aattgtcccg 21840 
taaaatcgcc actggtataa atgggagcga tttcatgcca cgggcacaaa tcctagggca 21900 
tatcgtacct gtatgatgga agcaccagcg gtttagcaga tagatgatag gttccttctg 21960 
ctctgcgttg cgttttgaat ttagttactt cgccggctta agaatttaga atgaaatgca 22020 
gtctctctta ttccttatta aactcacgta ctcccacatt cggcgactgg aggatacgaa 22080 
agcagtgttg gtgatgtttc ctgtaatgga tatcattttg ctgactgaat tattctatga 22140 
cctttccctc caacggcgtt cttatctcga cactttagat gttgacgctg ccttgaggaa 22200 
ctagctttgc gctgcgaagg ctatgagcag tggagctgca tcctttcgcc tagatatcca 22260 
ttctgcatag atccaaggca gggcttcgta agaaaagttc acgttcactg taagtccatg 22320 
caagcggaac ggccgcttaa acaagtctat acagtaaagc ctgcctataa gcaaccgccc 22380 
atataaggaa tcccgcgata ttagcatcta aaatccgcgt ctgaattgat tttctatata 22440 
aataagcagt aaactgcttg aaaaagccct gcrctcctat acaaagctac cttaattaga 22500 
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aaatataggt tgactagcta aaaargtgcc zzacaatazc qtattactat ataatactta 22560 
cacgaccacc ggaggtaggc tagaaat aza tatcgtaaac egacnacccc ttagtaaaaa 22620 
tatata: r ::g tatagacctg get gcaaaca anttcctat: ataagcaact ttttggtgag 22680 
ctgaattcgt tgettatage caggtttget gcaattgata aaaggtgcca attcatcata 22740 
atctatcccg categgatga attgttgacg atccacacca taaactgeat tatgttctac 22800 
attttcccca ttggtatcta ttgggtaggc aggttgaaag ccctrtctgc ccacctttgc 22860 
atatttatcc cccaccqzqc gacggccgcc gtgatcaatc cagctatagt cgaagcaacc 2292C 
acaccigcaa caactgegag catgtacact gagaggtaag aagaaaggta ccacacacag 22960 
cgtgaacgtc catccaacrc egagggegtc tatgaeggge acaacgagcg cactactccc 23040 
tgcagaaaag gagtatcgaa tcacatactt tcctgcaacg actgcagacc ggtcccgtgg 23100 
caaggcttct acttgeggtc agagacaaca ttattgtggg catgtacaca aaggggataa 23160 
attaatgagg gtgtggaaca aaccagccac gtaagtgttc aggcagttaa aactgcccat 23220 
gagcccccag cccgcgaaga acgccgcgat tatgggcact accatcccac ccttatcctc 23280 
ttggagtgtc cacccgtaaa tgagcgttcc cgcaggcagc aeggcaaaca atgtgatgag 23340 
cccgctgtgg agtcgatcct gagggagacg gaatccgcgc tttactatgt atctccgaac 23400 
ggtgcgatcc gaaagtttac cgccgacgag actccctatc aggaaccegg cacctggagc 23460 
gaggtagaag agacccgata ctagggcagt cgttaaatga aaccgtgagt tgaatatagc 23520 
acgagctgaa gtcaggatcg aatattgcgt aatcgecagg aggccacagc ataagtcctt 23580 
ttgtttgcgg catgagcatt tcgtttccag tagatgeaga gggcatatct cccaggcact 23640 
tacggcaaga aagacatttg gatacaccca ctgettgage acatccgttg gggagaattt 23700 
cgatatgatt gaaacaagtg tggtcggttt aaacgccgtt gagaccttct cagaagttcc 23760 
ttcgattttc gggaaaaata gcagggaaag cacgagcccc agtccgctca tacctagttg 23820 
aagccagaag ataacacgcc aactcgtgaa agtgacgatg acccctccca cgcaggggcc 23880 
tgatcaaaca gtcagttcct gttgggagtt ctagtacttt cagcagggta cgtacctatt 23940 
geagggecag aaagagtccc ggccatgaag aaacctaegg ccgtcccacg gtaaacctgc 24000 
geacgaaegg attagttttc gcaattgggg agacaagggc gtgtgatgcg tacaggctca 24060 
aagatatctg caagaacagt ttggcctgag accatgaacg aggttccggt taagccgctc 24120 
aatactctga aegctatgaa cattttctcg tttatcgccg ctgccgttcc ageggagcac 24180 
gcacaaagca ttgaaatggc cagattgtat gatgtccgcc tgccgactaa cttgttcatg 24240 
ggaccccata tgagggatga atatcccatg gcaaccaaga caccagcatt ggagatattg 24300 
atagtctcga cagtcatatc aaattcattc gcgatttcag gggeggcagg aagaagacaa 24360 
gtactggaga aagtaacgac tagagtcatc caactaacaa caaaegtaat gacacatttt 24420 
ctccataatg gaatatcacg gccctttttc tctggctcgt tttgtccgct accttccgag 24480 
gtcgtcgctg ggttcgggga ctcagtgtca ccgcggccca tttctgcaat ggatggcctt 24540 
ctcgtcactg ageggactet tcaaaagaaa atatagaaca tggtcaccca aaacatggat 24600 
gttcctgatt gaeggtttat gtatactcta tcatcggcct gccacacatt tcgccgagac 24660 
tacgttgata gtgtattgga gcgccgctag accttcggcc ttgctggccg agaegggtae 24720 
ctaataaggt agattgtgtc tttccgcaaa ggtattcatg tggatcagtc tagaatggct 24780 
gcaaggccca ctgacttgac taatgeaage caacgccagg atagttatat aaaatccaat 24840 
gagggatcaa categggate gaeateggga ccagcacccg gcatccacaa ggaceggate 24900 
cgaaaagtac ggccggccga aaggaaaccc atggaaggtg catagacctt cactgtacaa 24960 
tacagtactc tgtggacaat gatattggee accatctcac ccgtccgata atccatgtca 25020 
ttgatgactg tatttcaaaa tagattggat atctgagaag tgatgaagtc tatagtatgc 25080 
tctcttctgt atgagggatt ttgttggttc aacagggttt atcattagtt tgtcactatc 25140 
tggccttcgc cgtacaggaa tgcaccgcat gtgtttgtac tctctgtgac tccctcgaga 25200 
ctaaggacat aeggtaegta tattggctaa ccaccattat taaagaagcg gtggccgagt 25260 
gagtacctta ataaggcttg tatgccctct tagtagttgt eggataattt ctccaataat 25320 
agggtggata caggggtaac ggeggaaegg aaagaactcc gccgtcggcc tgcggcgcgc 25380 
ggcccacctg cacaaccagg ctgaattctc ttgaattctc tttttgatac eggaaggegt 25440 
gagaaggagg ggaaattcat catcttaagt gctcatctta tatctgettg aegaatgeag 25500 
attgeccagt gcgctcaccg gagcccgagc aaaggggcta gagegggtet gggttacata 25560 
acgacagtgc gacaccctct tcatgtaaag atctcgcctt attcactttc cacattttga 25620 
acatccgcaa ctgtgccgta cccagtctcg tgaagccggt aaaatggcta aattcgcagt 25680 
aaaatgeetc gatcacttgg tcctgactgt geggtctatt ccaagaacga ctgcatttt.a 25740 
taccaagcat cttggtatgc gacatgagac gttcacctcg ccacttaatc gaactatcca 25800 
gaggtattcg cgcggaagtt gtgatctaga ttgtcgcgat teggcttact gaacagtcaa 25860 
actaggcatg cgcrcatctt tggttctcag aagatcaacc tccatgagca tactaaggaa 25920 
tttgaaccaa aagcacgtaa tgtgcaaccg gggagtgcag atctgtgttt cttaacagat 25980 
acagaegtet gecaagtget caaggcattt agggacgccc agattgaggt atgtgtagca 26040 
aatgettcta aaagcgcaag cactgaaggt taggtgeagg tactagaaga ttccaaagtt 26100 
gtggatagaa caggggcaca aggcaagatc cggagtgttt atgtgcggga ccctgatgga 26160 
aatctagttg agtaagtgtt cttgttattt aacagatttc ccttgcttct aacatgtgtt 26220 
aagagtatca aactatgtct catctagcca ggctggcggt caataagctt tacaaggtta 26280 
tatgacccgt aactgagttg cgcctgtacc gtgattagag acaacattca ttgttccatg 26340 
ttggcctcgt tctgtcatca gttggttaca cactggctta taggcaacat ttcgegtaga 26400 
tttaggtagt tatttgetec gcagttccgc ttgtaatcaa agaaaacegg gttctggaga .26460 
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atctgtttca tttcaggcct ccgcctggac agigacocct gaggttctga tgggatatgc 26520 
tgggtttact czccacczza aacatagtcc acctctcaac ggcgtattgg ataatatcga 26580 
aggggatctc agttgctagr ccaacztaca cczaaaqzaq tcttgcaggc catataatat 26640 
tcgacataac tataccactc cagtgatgga aatccataac ttatgatact tccgaatgaa 26700 
cgtgtgtctt tcgtgtacat aagtccactt cataaaatcc aatatacctc aataaaagct 26760 
Lcgtaaatca tcctngcatc agcaccctcc ccccgccccc tttgtttccc acgccttacc 26820 
ggcatactca cgtaacctac aactcctttg tccgctcctc actctctgca atccaccctc 26880 
tcaaactggt ctcatccctc ggcttaaaac caaccttgaa tggcttcggt tccgttgtca 26940 
tcgatgtcgg agtagcgttg cactcaatag caccaagcac tggcatatcg gccgggtttt 27000 
gagctggctc tatggtaaac gccacgataa accgaagaaa gaccgtgtat agctcgcggc 27060 
tcgcgaggtg ggaggctgcg cacatgcgcg tcccggcgcc gaagctgtag tgaggggtgc 27120 
cgaagccttc gctcggctcg aggtatcgct ccgggagaaa gcggttgggc atgtcgaaat 27180 
gatcttcgtc gtagtttgct gcccacgcat tctgacagat caattagtcc aggaatgacg 27240 
ggataggacg ccgggggtta ggggttaggg ggattgattt taccatgaaa aaggtcgttc 27300 
cagcagggat cctcgcgcca ttatagatga ccuccttgat attcacacgc ggaatgcaga 27360 
taggcattac ggtccagaag cgcagggtct ctttcacaag tgcagtgata tagggcactt 27420 
tttcctccac tagacaccgc tcccaggcgt cgccattcgg atacacggac ataatttctt 27480 
cgtaagcctt ctgctgaatg cgttggccgt cttccgaaga caggtacgcg atgcccatga 27540 
tcaagttgcc aggaacggta tcaagtcccg cagaaaccat ggtcagacag atagacttga 27600 
tttccgctga gaaaggttag tatatataat cacaaatcat atagtggtgc atcgctcgtg 27660 
aaatagacat accatctgta agcttcgtct cgggattctt cagaatattc cccgtaatgc 27720 
acggcttatc tgtccccttg gccatgcgat ccttcaaaat atcaaacaaa aaggccatgt 27780 
acttatctct ccgcgcacgg agatgcttcg cctgattgct tctgttcgag aacaacctga 27840 
gcagaggaac gtaatcctgc cagttgttgc tggttgaccg gagattggcc acgccgcgct 27900 
gcacctcgca gatctctctg agaagctggt cgttcacatt gccctcgatg cggtagccat 27960 
agttcagagt taaactggtg ttgagcgcaa accgctggaa gtagggagta gggttgatat 28020 
ctatcttccc gccctgtgaa tctttcagca actccttaat actggccata ctctccagat 28080 
cgattatggg catataggac tgcaccgcga cacggttgag cgctgtcgca gccgccttac 28140 
ggcgccgttt acatgactca tcccacggtg acgttccgat cgtaaagccc tgtgagctag 28200 
agacaacacc atgaaatgtg tgaaatgtag ggcgcgatat catcgaagac tgctccttaa 28260 
tccacagctg gcgggtggac tcgaaggtgt tggcgaagat gacgcgccta ttgcccagac 28320 
gcgcctggaa cacaggcccg aattcctttg accatttgcg agcgactgtg gcgtgtttga 28380 
cacccagttg tatcagatta ccgaagatgg gcacgcctgg gatctcgggg attcccttaa 28440 
tcttagggat gtcggttcgg ttgaagtagc gggtgaggaa gtagataacc gcggtagcgg 28500 
cgatgacgat gatttggaga gtcatagtgg cgagatgcga atggtattga agaatagata 28560 
gataattctt ttactcaggt tggcatggat tgtggcccgg gctttatact tcaaccctct 28620 
atcgacatca ttccttaaag accaggatat tccgtcatag taacggcgat agtgacgcgg 28680 
ccgcgttctg tcagcccgcg atcagctctg tcatgtggcc aatattctga tctacattgg 28740 
tttcagactg atggtctggc tcgaatcgaa gcttcaaagg gctctcacaa cgctgcgttt 28800 
ccgattatcg gtcctatctt ggtgcctacg gacacggcgg ccggcgacga tcgagcggac 28860 
ggccggcacc cggcgatgga tctgatctgc cgcacatctt aatgataggt attaatggac 28920 
ggctgatcta accttataag ctataacact tatggagagt cgaatgcaga gttgaaggag 28980 
acgttggaag acaaacactg tagtatggcc gcctctaact agttcactga caagggtgct 29040 
gtaacaggca tagactgtta ggacggatca acccgaccta agctgaccca ataccaagtc 29100 
tacggggtac tttctgtaaa gaggctggag accgcgacgt tcaattattc caatctgttt 29160 
ccaccactat cttatatgta taagttgtct tccctcggtt aacttgctct tcatgttaca 29220 
tcttgtcact aataggcatt tgatgtttga tttggctatt gactattgac agaacctcct 29280 
atgaatttcg cctttcagtt gccgtcggtg gttgctgtcg ggccgactac ttgttgccgc 29340 
tggtgattga atgctactta tatatatcca tgtctttttg tcggctttta tcacgccact 29400 
gccgccggat ttatccggta gaccctccta tcctgtcccc aaagcggggg aatgcgtcaa 29460 
gatcttcggg gatgaaattt ccccgcatcc agctccctac ctatactaca cagttaccaa 29520 
caattggcaa taaatagaca aacataacga acaaaaggca gctagaccgg atattaccgg 29580 
gtaaacgggt ttgtaaattg gacatcctgg ctctctccag agctacttaa ggattgtctg 29640 
ttggagaagc caaatgagaa agaggtgctt gtattagctt ccgaagcgcg tttctcgata 29700 
cctacctagt caatgcgcct ttcaaattgg ggctacatgc ttcttgaggg ttttatcagg 29760 
accaacttcg atctgtccgt tgcaagaacc aagaaacctt tccgccaaca aagcaaaact 29820 
tgaccatgaa gcccgcaatc cttatgaaat actggctctt cgtctcagct gtgagcgcgt 29880 
caaccctgaa cggcaagctc acattgagtg agacaaaggt gacgggggcc gttcagctgg 29940 
cttgtaccaa tagtccaccg gacatctata tcgaccccga tgattcggtc tcagtggttc 30000 
gcgcagccca cgatctggcc ctggactttg ggcgcgtctt tggtaaaaat gccacagttc 30060 
gcttcactaa cgagactcat ccaacatcga tggccatcat cgctggtacc atagataagt 30120 
caaccttcct tcagaggttg atagcggatc ataagctcga cgttaccagc atccgtggcc 30180 
agtgggaatc ctattcatca gcactggtgt tgggtccagc caaaggcata cagaatgcgc 30240 
tagtcatagc tggcagtgac cgtcgtgggg ccatctatgg cttatacgat atatctgaac 30300 
aaattggcgt ctcgccattg ttctggtgga cggatgttac cccaaccaaa cttgatgcca 30360 
tctacgcgct agatgttcag aaagtccagg gtccaccgtc agtgaagtat cgtggaattt 30420 
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t cat caacga 
agaacgggga 
gcccgaaagg 
ccaacaatgg 
t gacggttgg 
gagcaacaaa 
tcaa tgttaa 
acacaa tggg 
aagaaagca t 
tgtcgaacgt 
tatcgtttgc 
a tgactgtac 
cgtctcccac 
gacatgaacg 
tgggatcaac 
ggggata t 



cgaaccgccc 
ccccgcct tc 
gaattacctc 
cccactagcg 
gactccct gc 
cgagcaa tec 
agcat t tatg 
ca tgcgggg t 
tgt tagctgg 
gg t tcaacca 
tgactgccag 
cagaccaggt 
t gcagaatga 
cgccgccgcg 
tgcgcgcggc 



gecttgeata 
at ct cacgt t 
tggceggega 
gactactacg 
t tgaaagece 
cagtttctaa 
agggagggtg 
etaggega t g 
caggaa tccg 
t ttgtcctat 
gtctgtgaca 
cacat tgata 
aactgggcgt 
ctgttacaag 
a tacagccac 



actgga ttct 
tctacgccca 
tg tggtcaaa 
gagtggtaa t 
at gctgacta 
aegggaegtg 
taa 1 1 aggag 
ctgea tcccc 
tgctatcgga 
t tga tgt tag 
caggaactgg 
tatcctgatg 
tctgggggcg 
tggatcaaca 
ggtgct caga 



tgcaaat tat 
tqtcc tcgag 
tatcctttat 
gggcactagc 
cgaaaaagaa 
ggactgga 1 1 
ccaacactgg 
cacacttaac 
ca tcctgaa t 
gatcca t tea 
gaacttacta 
acaatgeagg 
caggaatt ta 
cagctcaact 
cagta tgggt 



ggcgaggt tg 
ctga z cc tgc 
g t tea t gaca 
cacact ggta 
ccaa ccgctc 
agcaa cgagg 
gagaccgea t 
gcaacagtgg 
aaaaccaacc 
ccct caaata 
tgagagegge 
caata tgctg 
ctatcatttt 
gatcaggacc 
tgccaa tatt 



304 c :■ 
3054 : 

3oec : 

3G6cI 
3 07 2" 
3 07 E I 
308 4 3 
309C : 
30963 
31022 

3ioe: 

31142 
312C2 
31263 
31322 
31325 



<210> 20 
<211> 5053 
<212> DNA 

<213> Aspergillus terreus 
<400> 20 

atggcgtctc tacttttctt tacggtgttc aatctaacac tggctcttct atcatctact 60 
gecacaggag cagccgtccc tgtctcgega cccacagacg attcgagata tatagacttt 120 
gaegctgetg aatggcgtcc aagagcaaaa cgagatgatg ccctgaaagt ccctctacgg 180 
atcctccctc ttggcgcatc catcacctgg ggatacctat cctcaaccgg aaatggatat 240 
cgcaaacctc tccgtgacaa acttegcttt gaaggctggg aggtggacat ggtgggcagt 300 
aagtccaacg gtgacatggt agacaatgta tgcatctctc ttcccccacc cctccgacag 360 
atagacagac caattgacat ataaacgegg gaaaaggatg tagaagecca cagcggcgac 420 
gtgataaege aagtgcaaac cgcggccgca aactcgctcg cctacaagcc gaacgtcgtg 480 
ctgatcaacg ccggcaccaa cgactgcgac tacaaegteg accctgcgaa cgccggcgag 540 
cgcatgcgct ccctgatcga aaccctaatc ggcgccccgg acatggccaa cacgctcatc 600 
gtcctgtcga ccctgatccc ctcgggttcc acaaccctcg aagctaacag gccctccgtc 660 
aacgegcagt tccgcgagct ggtccttgac atgegegagg cgcagaatgt ctccatcgtc 720 
ctggccgata tggatccgcc ggctcccagc cccggaaaca actggatcac gtaccccgat 780 
aacttcgccg ataacaagca ccccaacgac tacgggtact cccagatggc agacatctgg 840 
tataacgega tctacaacgc tgcggtggcg gagctcattg tcaagccggc ggaccttgac 900 
atctcatcca eggggacctg tgacaaagag tacgggagcg gagtctaege tggcgggttc 960 
aegcagcaag ggagtggtga ggatgacgga atctatcgac aegacagega gtatagcggg 102C 
gcgttgttta ctgtccgcgc egggaagggt gcagccgatc catacaagga tgacgacgag 108C 
ctgcactttt tettegggag gctttatact agggegtatg atgacatgat gatctrccac 114C 
aaagataagg actccggcgc ggtgacgttt gtttcttaca cgaataatgt ccacactgag 12CC 
gagcaggagt ttacgaaggg ggggacgttc tcgactcata ataattgtaa cccggggggt 126C 
gtgeatttta tcgacatcaa eggtaagcac tgtgtctgtc tgccgaggaa ccatctggga 132C 
ctgatttatg tgtgataaat gataggegae ggacttgatg actacatctg catcgccttg 138C 
gaegggacca cctacgcaag catcaacaat ggagacggcg aegecaagag caacaagcct 144C 
ccatccttca ccgatategg actatggaag agtcccgaag gatacgatca ggcacatgta 150C 
cgccttgctg atatcgaegg cgacggccgc gccgactact gcggtttggc tgacaaegge 1560 
gaegtcacat gctggcgaaa tggatggatc gaagatatcc ccgcatactg gcagccgctg 1620 
ggcaageget teaeggggaa agtcatggga gacctgcgcg gcgtgcgatt cgaggatatc 1680 
aacggcgacg ggegegaega ctggatgtgg gttgatgacg atggcgctac gacaacatac 174C 
accaactccc ggagctgeat caaaggagag tctggtgacg ggttgaacgt cgtgtggcgc 180C 
caggggttct accaagatgc taactctggc ccgtcgcatc ccggaatggg agtaatattc 186C 
gggacatccg gattaeggga tcaggtctac tttgegegae tetatggega ggtggcggat 192C 
tttggagagc tegggagaca ggactatgtg ttcatcaaga aggatacctc tgacaagtat 198C 
tttgggccgc tgtattacgt tcatgtgtgg aagagcaagg gegcaggagg ggctaagatc 204C 
aaaggtatgg aaggaagttc tcatagagag gttgattget aattgttata gecgaeggag 210G 
acaggtattg caatatgatg ggccacgaca atggtatgat ggactacatt tggatccatt 216C 
caaccggcca tatgegtett tatccgaata ggggcctggt tgaagtcccc gecgaegggt 2220 
cgagcttctg gggggcgaat gagattatct tcgaccccca agagcagatt ggcatgaagc 228C 
ttgaccggcg egatctgeat ctcgcagact gggaeggega cggagcctgc gatataatct 234C 
ggaeggatec cqacaatctg aacagggccc aagtttggcg gaacaagatc aaagacaegg 240C 
ggagttttga ctgggactac aatatcaatg ctgcagatga getttactge cccgagcacc 2460 
gaggecttgg tttctttgac cggccggtcc attttgetga tgtttctggc aacggcaagg 2520 
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ccgattatct gtgcgttgag aaggacgacc 
gatgggacta cattgatcaa mcaagtact 
gcGCcgacct caacggcgac ggaaacgccg 
atgggtcggt gtggcacaac cttggccaac 
gaccgcaggg tcccaagtac cgaggggcgg 
tgaacggcga cggtcgtgca gacangcaca 
agacgtggta caacgaatgt gccaccaaag 
accccaatct acctgtatct cctgraaaag 
acaacagcga gtgcactagg gcccaggtgc 
ttcatgctgc ctcggaagtt gcgtacttta 
tcttctttgc cgaatcactc accgacagct 
tcgaccggat ggtcaccatg atttccgggt 
tcacttgcaa aaaccttcgg ggctgtgacg 
ggaatcggct taatttctgc ccaaagttct 
tgctcgcgag gtgcgactca attaatcttc 
ttttgcacga agtaacgcat acggactatg 
ggtatcacat attttttctc attcccaggc 
cgcgattatg tctatggatg gaaaggagct 
cactgtatcg aaagaggcag aaaggctgaa 
gatgctaact ggcaacgcag attgctttgc 
atctgtgaca gcaagttgtc cgcctacaat 
gggtactata ccaagatatg tggtcgacag 
gcggatgact ccagctgtcc ggcctacgat 
ggcgtcaacg attacgttca cttcggtgac 
acaaccggtg acagttgccg cgtgggaagt 
tttgatactg aggatttcac ttataccaac 
ctgaataaaa agatcgacca gtggctagga 
ctgacaattg gagggaacga tgtgttcttc 
atgtggtggt actcgcttga gcaataccgc 
cgcaacctga tgcaggatac agggtctgac 
gaaaagatcc tggatagatc tggctctagc 
tgtcgtgctg tccttcgtcg cgctgacttt 
ctatgtcacc ttcttcaacg aagacaccac 
aagcccacac tacgacccgc agcaatccgg 
caaggaactc aacgacctcg tccgcatgct 
catcaacacc gcccggaata cggagcagat 
cggccaccgc tggtgcgagc ccggaaccca 
cttcttccta tccgcatggc ccgatatcgc 
gaacgcgacc gagacagacg aaattaccgc 
cgatgcggat acgtgccagg atgcgctggg 
gtgtgacgtt gcggtccacg tcaaggcgaa 
ccgagcgaat caggccattg ccaataggga 
gcctagtccc tag 



gcacctgggc cngggtcaai: ggggacgacc 2580 
ccgaggagaa ggacagggcg aatcuacac: 2640 
atatgatctg gacagacaag ttctcgggec 2700 
gtgatatcaa gggatcgcga tacgaatggg 2760 
ttgaaggctc atgcacttat ttccctgatc 2820 
gcatctggaa ctccataaac aacacagcgc 2880 
accacacagg cgatgacggc ccgataacta 2940 
cccccatcga gctcacccct cattatcagg 3000 
agacgctctt tgaagaaatg caatatgccc 3060 
gcggcggcgc atacgaccca tatagggaca 3120 
tgaccttcac tacaaatgta aggtatacgc 3180 
cttcgcaatt cgacgacgaa aagttcacga 3240 
agaacggctg gttggccatg atgaacaata 3300 
tcacagatga gtcgaagagt tccaggtcag 3360 
atgatgccca tctcactcga gccggggcga 3420 
ttatggagat tgtcaatgga gagaatgggt 3480 
cgatactgat tcttfcacct taggaggacc 3540 
cgagatcttg ccgcagggac cttcaatcga 3600 
agagccgcta atgagctccg tatagccggc 3660 
ccagacccaa ataacctcgg gcaagaaggc 3720 
gcggattcat gggctcttgt cgtacttggc 3780 
attccccttc ctgaggagtc tgcttcttcg 3840 
gattcgtctt atgatgctga cactgtgtac 3900 
tcctacgccg ctgggatggg tacaggaacc 3960 
aacagctacg gaaagctcgt ccaggagtgg 4020 
tatgcgtgct ctggagatac aacggttggg 4080 
caggacccca cggggactac catggcaacc 4140 
agcgatctgg tttccaactg cgtgctaaca 4200 
cagtggtgtc tggagactga agagaaagcc 4260 
ggactcggct cgaaacttag ggctgcgtat 4320 
gtatatctcc ctgttatcct tatttattcc 4380 
actttagtcg ttcaacctct acgtccctgg 4440 
cgactgcgac tcaaccacct tctggtacga 4500 
caactatgtg tggctcacga ccgacctacg 4560 
caactcgtta atccaatcca ccatttccga 4620 
ccattacatc gatatggacg cgcgatttga 4680 
agaaccagac cccgacaacc caaacactta 4740 
gattgttgga gacacgacag ccgagagcac 4800 
gcttatgaac tccggatcga tccagctgcc 4860 
atctgacccg gatccctatg cggttttcat 4920 
ctcgtcgagt ttgatcgcgc agagcttgga 4980 
ctatagtagc caggatgtct cttggtggtt 5040 

5053 



<210> 21 
<211> 987 
<212> DNA 

<213> Aspergillus terreus 
<400> 21 

atgactctac caacacttcc taactggata aggatgtgcg tgcatttgtc ccttacacat 60 

ctccatcagc accgttcccc gaaatacgag tctataccta ttaaaagtat ccaggctaat 120 

tcacacagaa tcctcatcat cctaaccaca gcctccttct acccgcagat ccggtgcatc 180 

caacttcgaa actccacgca cggcatctcc actgcctaca tcctcttcaa cctaatcagc 240 

gcaacagaac acttcaccat cctattcgca ttgctggtaa acagcggcgg agatgtcctc 300 

atccatgagc cccccacgac cggcgacggg ttgaacctgt accagctttt cgcagtgtgg 360 

atgggatgct tagtcctctt ctgccaagca atccatagcc tccacgccaa tccacgccgc 420 

aaactcatcc tactaaccat atacattcaa tacctatgca tttctatctt accagaggtc 480 

atcgacgcaa tcaccactcc cgaggaaacg agaaaacaaa ggccgccaac gggcgagagg 540 

aactggctga tcggactctt tctttccgcg cacgcgatga ccgtcctgcc actatcggcc 600 

gtgctccgca tcgcgggatt catagatcag tcgcgactga tctcgcggcg cagacgggag 660 

cagccatcgg tcttaagcct gacaggcctg gcgtgtcagg ccgtggtctt tgctctagtt 720 

tctggactct gggtactcag ggttcagcag cctgttcctc gaatgccgat gagaagacct 780 

gtggattgga tgtattggta ccatgtaatt gggtggccgg ttgtcgacga tgcggtttat 840 

gcgctgggac aatgggtttt gttttggtat gcggtttgtt ggcgttctcg gggcgatgct 900 
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agggatgaag cagtccatgc tcgggagact gatgacctct taggagagga tgaagggcat 960 
gggtacggcg gaaccgggac ttcttag 987 



<210> 22 
<211> 972 
<212> DNA 

<213> Aspergillus terreus 
<400> 22 

atggtcggga gcaagttagc ccataatgag gagtggcttg acatcgccaa gcaccacgcg 60 
gtgacgatgg caattcaagc gcgccagctg cgcctctggc ccgtcattct gcgccccctt 120 
gtacattggc tcgagcccca gggagccaaa ctccgggcgc aggttcgacg agcccggcaa 180 
cttctcgatc ccattatcca ggagcgacgt gcggaaagag atgcctgccg ggcaaagggc 240 
attgagccgc ctcgctacgt agactcgatc cagtggttcg aggatactgc caaggggaaa 300 
tggtacgatg cagccggggc gcaactggcc atggactttg ctggtatcta cggaacctcc 360 
gacctgctga tcggtgggtt ggtggacatc gtccgacatc cccatctcct tgagcccctc 420 
cgtgatgaga tccggacggt catcggccaa gggggttgga cacctgcctc gctgtacaag 480 
ctcaaactgc tggatagttg tctcaaggag tcacagcgcg tcaagcccgt cgaatgtgcc 540 
accatgcgca gctatgcatt gcaggatgtg actttctcca atggaacctt tatcccaaaa 600 
ggagagctgg tggcggtagc tgccgaccgc atgagcaacc ccgaggtctg gccagagccg 660 
gcaaaatacg atccttaccg gtatatgcgc ctgcgagagg acccggctaa agcgttcagt 720 
gcccaactgg agaacaccaa cggggaccac atcggcttcg gttggcatcc acgggcttgc 780 
cccggccggt tctttgcctc taaggagatc aagatgatgt tagcctactt gctcatacga 840 
tacgactgga aggtggtccc cgacgaaccg ttgcagtact accgccattc tttcagcgtg 900 
cgcattcatc ccaccacgaa gctcatgatg cgccggcgcg acgaggatat ccgccttcct 960 
ggttcactat ag 972 



<210> 23 
<211> 771 
<212> DNA 

<213> Aspergillus terreus 
<400> 23 

atgcgttacc aagcatctcc agcgctggtg aaggcgcctc gagcgcttct ttgcatccat 60 

ggggctggct gctctcccgc catcttccgc gtgcaattgt ctaagctccg ggctgcgctg 120 

cgcgaaaact ttgaattcgt ctacgtgaca gctccgttcc cttcctctgc agggcctggg 180 

attctccccg tcttcgccga cctagggcca tattactcct ggtttgaaag cagcagcgac 240 

aacaatcata atggaccctc cgtgagcgaa cgcctcgccg ccgtccacga ccccatccgc 300 

cgcaccattg tcgactggca gactcaacac ccccacatcc ctatcgtggg tgctatcggt 360 

ttctccgaag gtgccctggt gacgaccttg ctcctctggc agcagcagat gggtcacctg 420 

ccctggttgc cccggatgag tgttgcgctg ttgatctgtc cctggtatca agacgaggca 480 

agccagtata tgaggaacga agtgatgaag aaccatgacg acgacaacga cagcaaagat 540 

accgagtggc aggaggaact ggtcattcgg ataccgacat tacatctgca gggtcgcgat 600 

gattttgcgc tcgcaggatc gaagatgctg gtggcgcgcc atttctcccc ccgagaggcg 660 

caggtattgg agtttgctgg gcagcatcag tttcccaatc gaccgcgcga cgtgttggag 720 

gtcattaatc gttttcgtaa gctgtgtgtg acggcccaga cattggagta g 771 



<210> 24 
<211> 1253 
<212> DNA 

<213> Aspergillus terreus 
<400> 24 

atgggcgacc agccattcat tccaccaccg 
gatgaagtca ccgtctggaa tgccgcaccc 
gtccgcgtcg aggccgtggc gatcaatccc 
acgccctggg cgtttctcgg aacggactat 
gtgactcata tccaagtggg tgaccgggtc 
accccggatc agggggcatt ctcgcagtac 
atccccaagg gcttgtcgtt cgagcaggct 
ggattggcga tgaagttgct tgggctgcct 
acccactcca agccggtgta tgtgttggtc 
actatgcaaa tgctccgcct gtaatgcttc 



cagcaaacag cgctgacggt aaatgaccat 60 
tgccccatgc tgccccgcga ccaggtatac 120 
agtgacacga agatgcgcgg acagtttgcc 180 
gccggcacgg tcgtcgcagt gggttcggac 240 
tacggggcac agaacgagat gtgcccacgc 300 
acggtcacgc gaggccgtgt ttgggccaag 360 
gccgcgctac ctgcgggcat cagtaccgct 420 
ttgccatcgc cttcggcaga ccagccaccc 480 
tatgggggca gtacggccac tgccactgtc 540 
ccttgtcctg agacttttct ctccgttggt 600 
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CGLcgccic: 
z tgcaacs z c 
t tgacta teg 
acccctccca 
acaccaaqaa 
tctgcttcgc 
aacacgccgc 
gcgagggat c 
teggegagga 
ateegt tgee 
ggaagggaga 



acaaccqazc 
c zccccccac 
ggccccgaa t 
ctccactttg 
caa zczccgc 
ageca z egge 
cacgcgcaag 
aacct ggece 
tctgtggcgc 
eg tcct gcac 
getgeegggg 



C Z tcTSC lcc 

aaz z tcQccc 
ctcgcgcaga 
gect tacaga 
tatcctctcg 
cgcgcggcgg 
acggccacga 
gcccccta tg 
a tegegggge 
ggegget teg 
gagaaactcg 



Ca t CCGCtCC 

l ggccaae n c 
cga zeezcaq 
acagaccatc 
actgt a tcac 
ggcactacgt 
ccgactggac 
ggcgt ccegg 
agetegtega 
atcacat taa 
tggt tegget 



cagctccgga 
qcccggccca 
tgaacccctg 
gata t tcttc 
caacg tcgag 
ctccctgaac 
cctggggccg 
cagtgaggaa 
aga tggaege 
gcaaggca tg 
egaggggecg 



talc" tCCaa 

gaggaggtct 
ccaccgccct 
tagegtaect 
tccaccaca t 
ccgt tccctg 
accatctttg 
gageggcagt 
ctcgtcca tc 
gagctcgtcc 
taa 



66C 

720 

78C 

840 

900 

960 

1020 

1080 

1140 

1200 

1253 



<210> 25 
<211> 1394 
<212> DNA 

<213> Aspergillus terreus 
<400> 25 

atgggatcca teattgatge tgetgeggea 
ttcegcaagg ccgtgaaatc caggcagatc 
agtggtgaga gaccccaatc ggaccccttt 
acagegggae atacctaggc aatctaaatt 
gaegggaega gtgcaatcag ctgccgccgc 
gtgcgaccaa gctgctgacc acgatcatgg 
acttggatga gacggtggat aggctgette 
ggtttgacga cgegggaaac geaagattge 
acctgctgac gcatacatcg ggactgtcgt 
acatggccca gggccacctc cagteggcag 
cgccggccgt caacgaccct ggggcggagt 
gtaagctcgt cgagcgggcc accggcctcg 
gtgcgccgct gggcatcacc gacatgacct 
cgcgccgggc cgaccaaacg caccgcaact 
eggtctaett ccgggccgat ggagaggagt 
ctgggtccta tatgaaggtg cttcactcgc 
cacagaccgt ggacttgatg tttcagcctg 
accagcacat ggacgccagc ccacatatca 
gtegcagett tgggctgggg gggatcatcg 
gecgaaaagg ttccttgacc tttgggggtg 
ctagattcct ttggtttatg tcaccttcaa 
attagcaaat cgaccccaaa gccggcctgt 
ggaatgaccc ggtctgtcgt gatctgacac 
accagcaggg ttaa 



geggatcegg ttgttctgat ggaaaccgcc 60 
cccggggcgg tcatcatggc ccgagattgc 120 
gcgacaatta caagcacacc gagacgaatg 180 
atacgegctg ettegggget cggacggtgc 240 
tacaggtcga caccccctgc cggctcgcca 300 
ccctacaatg catggagege ggtctcgtgg 360 
eggatttgag cgcgatgccc gtgeeggagg 420 
gagagegteg ggggaagatc acgctgcggc 480 
acgtcttcct ccatccgttg etcegggaat 540 
aaaagtttgg catccagagt cgcctggcgc 600 
ggatctaegg cgccaacctg gaetgggegg 660 
acctggagca gtacctgcag gagaatatct 720 
ttaagctgea gcaacggccg gatatgettg 780 
eggeggatgg gcgcctgcgc tacgacgact 840 
getteggegg ccagggggtg ttctcgggcc 900 
tgttgaagcg agaegggetc ctgctgcagc 960 
ccctcgagcc gcgactcgaa gagcagatga 1020 
actacggtgg gccgatgccc atggtccttc 1080 
ccttggagga tctggacgga gagaactggc 1140 
gcccaaacat tgtgtgggta agctcggctc 1200 
tgttgatcca gectactaag tgctattgta 1260 
gcacccttgc gttcttccaa ctggaaccct 1320 
gcacattcga gcatgccatc tatgegcagt 1380 

1394 



<210> 26 
<211> 3504 
<212> DNA 

<213> Aspergillus terreus 
<400> 26 

atggatccgg tggttagaaa gccggaccct 
ttgcgtgcca ttgtgggaca cgcgtgtcga 
ctgaccgcgg caacgaccca tcttcatgtg 
ggtctggccc cctgggccaa ggaaaccccc 
cgcactgtta gectgggaga ggctagegea 
gtgeeggagg atggccaggt atgatagatc 
cgtgctgacc ggaccgtccc tgtgctactc 
tcaccctcga tetacegggt gcgtctgtcg 
tctcagggtt cctcggtgcg gaacagacca 
atcattccgc gttgacgttt cgagttccct 
tcgaaattat accctcggaa aaagaggatg 
aaggaagtcc caggtcactg ggacactggc 
gtgttcacca tgeggagaca gtcgacttgg 
atatgactgt ggtctctctc tttcgggtga 
cagcctcggt cctgctgtct ggtgcctttg 



ggcggggtgc agcatcgagt gaccaaagca 60 
catcccattc acactctgct agtcacggcg 120 
ctggaaggga catatcaggc tactcacaga 180 
ttgaaegtec agtcatttct ctggggaagc 240 
tggaaatggc agatagacga ccgacctaag 300 
tcctgcgctc cattggtccg agaaattctg 360 
tcaattagtc tgactttcac tgggctcttg 420 
aegecagtat ccccttccta tcaaacacgc 480 
cgcccacccc cgattcatcc ccctcacccg 540 
actcccaact agatggcttt ctacaggctg 600 
atagttggag actgaggtct cctcgcgaag 660 
teggaagetc atggctgtca ttccttcatc 720 
tgatcatagg gctcagttac etagecatga 780 
tgcgccacct cggctcacgc ttctggttgg 840 
cttttgtact cgggcttgga atcacgacta 900 
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catgcgacgt ccccatccac acccttcttc 
cagtgggcct tgagaacccg a zccaac zaa 
tgtggggcgg ggggcaccgg caactzccca 
accaattgat tcctaacatc atccaactcg 
gatcttacct cctggaaatc ggcgcgttgg 
gtcctggcca zzztzqcv.zc ctggcgccat 
ttaccttcta cgccaccatt ctttgcgtga 
gagggcttgg tcaagttaac gccaagcatc 
cgacaaacac cacctggtgg aagctactga 
tccaattgtc gcccttcttc tatcgggtca 
cccctactgc tgtcagtcct ttcaaagaag 
cggcgcgcgt cgaggggttt gagacacgcg 
tggaatcagc tgggttcaat atatcagcca 
atggattgga aagcccgctg ggtcgactat 
tcctcaacaa ccacctgacc cacgctgctc 
ccgccgtccc tgatggctcc tacttgtcgg 
gtactcgccc cccagaagaa acagaggccc 
cggacgacga gctggtggaa ctgtgtctcc 
agactctcga gcggattgcg gcgggatcat 
cgcgtgccgt gcggattcgc cgtgccgctg 
gcagcggcct ggcggagtca ttgctccctt 
gcgcctgctg tgagaacgtg gtcgggtacc 
tggtgatcga tggacaggcg ttgttcattc 
cgagcgccag tcgcggatgc aaagcgatca 
aaggtgatgg tatgacgcgt ggtccctgtc 
aagcccagcg ctgggttgag tctcctctcg 
cgaccagccg gtttgcgcgg ctccaaaccc 
acatccggtt ccgcaccacc acgggcgacg 
ttgaaaaagc cctggaggcg atggccgccg 
ccttatctgg caatttctgt tccgacaaga 
gcggcaagtc cgtcatcgcc gaagccacga 
agaccgacgt cgatgcgctg gtcgagctca 
tggcgggcag cctgggcggc ttcaacgccc 
tggccactgg tcaggatccg gcgcagaatg 
aaaagtaggt agcttctcta cgttttgatt 
gtgtgtttgc taatggtggg tttctagcat 
gccctcgatg gaggtcggca cgattggcgg 
gttggacttg ctaggtgtcc ggggcgcaca 
cttggcccgc attgtcgccg cggccgtgct 
tgcggcgggt cacttggtca atgcccatat 
caagaaatga agggatcgct gtgattgatt 
ggtacagagt acggagcaat tagaacaccg 
gacatccatt tcgtgcaaat cgaatataaa 
ttgtcatgta acttaggtag gtaa 



ttttcgaagg aatcccgcac etc:: tccga 96' 
cccctactgc tctctgcgtc tegcaagaac I C 2 I 
atggcgccag cagtgatgat acccggcaaa 1CEI 
eggttgateg agaggggtgg tatattgtgc 1 1 4 C 
cattaggggc ggtccttegg ccaaaggata 1210 
ggacactcct gattgatgee gttctacttt 12 £ Z 
aattagagat cacgcgaatc cggagcccag 1321 
etteggggat ttttgggcac aacgtcaagt 1 3 E I 
cggtgggcgg ettegttcta tgtcacttcc 1441 
tgggagaata tatggctaat ggtactctgc 15C1 
cggccaacgg actcaacgag atciacctaa 1560 
taaccgttct gccgccactg cagtaegtet 1 62 T 
etaaaegtte tacatttgac gg^gtgeteg 16EI 
gtctcatggg cgcattggtc gttagcctgg 1741 
gctggcatgc ttggccccaa gecagagagt 18CC 
tgccatgctc tgccactgcc cctgaagtct 1861 
tcctcaaatc gaaccaagca gaatctctga 1921 
ggggtaagat cgeggggtae agtttagaga 1953 
cccgctcggt gacccggctg gaggcattta 2040 
tgtcgaaaac gccctccact cagaacctct 21CC 
ategegacta taactacgag cttgtgcatg 2162 
tgcctctgcc cctgggagtg gccggaccca 2220 
ccatggccac aaccgagggc gtgctcgttg 2260 
atgctggcgg cggtgccact accatgctca 234 0 
tgcgattccc gtcggcccaa cgtgcagctg 24 C I 
ggcacgaggt tctggcggcc gccrccaacg 2461 
tgacggtggc ccaggcgggc atctatctct 252: 
cgatgggcat gaatatgatt tegaagggeg 256 I 
agggtggatt tcccgacatg cataeggtta 2640 
aatccgccgc cattaactgg atcggcggcc 2700 
tccccgcgga gactgtccga caggtcctga 2760 
acacggccaa gaacctggtc gggagtgcca 2820 
atgcctccaa cctcgtccag gcggtgtttc 2860 
tggagagcag tagttgeatt acgaccatga 294C 
ttctcctccc ggttatatat attcacgtgg 30C0 
cgatggaaac ctgcacatcg ctgtctcgat 3060 
aggcaccatt cttgaggccc agggagecat 3120 
ttccaeggag cctggcgcca atgcgcgccg 3160 
ggctggcgag ttaagtacct gcgcggctct 3240 
gcaacacaat cgcagtgcgg gtgccacagt 330 C 
cteggggcag cttcaaagga cgctatctcc 336C 
gtatatgtgg taatcttaga acatgeggga 342! 
aatacctacc tacgtagaaa agtacctacc 34EI 

35C4 



<210> 27 
<211> 1512 
<212> DNA 

<213> Aspergillus terreus 
<400> 27 

atggctgcag atcaaggtat attcacgaac 
cgcaccggtg gaacattacc ccgccgtgca 
caaaagatca aatgtactgg aaataaggag 
cagcaggctg gacttcgatg cgtctacagt 
tccagggcag eggatctegt ctctgctgac 
ccagtgccct cacagagctt gccgctagac 
eggcaattte ttgatccacc ggacagctac 
gaggctattg acactgactg ctgggggctg 
ttagagccaa cgctgccgga tctaccttcg 
ttgccaccgg tatcgagega cattgetegt 
gaectgtegg cggtgtcgca ggaactggaa 
ccgaagcagg aaatctggac ccgtgcgtcg 
atagcacagc gecgacaaaa cgtatgggca 
ctagatccca tcggaatgtt tttcaatgcg 



tcggtcactc tctcgccagt ggagggttca 60 
ttccgacgct cttgtgatcg gtgtcatgca 12C 
gttactggcc gtgctccctg tcagcgttgc 18C 
gagegatgee ccaagcgcaa gctacgccaa 24C 
ccagatccct gettgeacat gtcctcgcct 300 
gtatccgagt cgcattcctc aaatacctcc 36C 
gactggtcgt ggacctcgat tggcactgac 420 
tcccaatgtg atggaggctt cagctgtcag 480 
cccttcgagt ctacggttga aaaagctccg 540 
gcggccagtg cgcaacgaga gcttttcgat 60C 
gagatccttc tggccgtgac ggtagaatgg 66C 
ccgcattccc caactgcttc ccgtgagagg 72C 
aactggctaa cagacttgea tatgttctca 78C 
tcacgacggc ttcttactgt cctgcgccaa 84C 
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tctacggcag tacactctta catat^oa" ct^:!^ S"tacggac caaoaacctc 900 
ccctgtcgc aaattaooca arrrr--*-- gtgcgg^c-r c oacccc - r " \ LC ^ 
tcccagtcgc ccagcaoaaa lattice-^ £<?Ccata ^ ccccaccaga aaaffatcS? in™ 
atacccttct tcagcoaaes " ! ? sgcagcagcg gccacaaclg toltofcacc 0«n 
acacacgccc tattcicooc ctoc-ct ^ 9 ? C<?a 9 ct ^ "tcctatot taacccccto n.n 
aacgagatta ctctgggagr ' ^ caoo^ 9 " 9 ^"caatt gctccgtga^ 12oS 
ggggaaccag gegaggatae aaccaaafca " CSU? "gctrccat caccatgaqc llfin 

x??ss — ss-t »» 



<210> 28 
<211> 2161 
<212> DNA 

<213> Aspergillus terreus 
<400> 28 



?il III3 I£ii ssss kss s „ 
ilE SI Sii Pi ilii Si! 

cacgctctag ctgtgccctc caacccctoo 'f tccac ? cc< 3 ggttcctatt gactc?acaf 

IlIIIlIslIIIlIIII I 
lllllflli liiSllil 

aagatctttc getcaegg^g attctggtcc loot????" ^ctacgact Icgccacgtc IoS 
gtttgecaca tttctccttc tcgtcatcca tttac atcaa ccttc ccactggqac 9fin 

iiii IS IEi Sii ill | 

S SB! ass S iHII as 

Isiiiiiiipiiii 

r~ Ms S5S5S; ~ asj 



<210> 29 
<211> 8035 
<212> DNA 

<213> Aspergillus terreus 
<400> 29 



atgacaccat tagatgegee cggtgcgcct gctcccataa c-t ^ 

gutcccatag ctatggttgg catgggctgc 60 
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agatttggcc cacccocaac agatccccag aaactctcaa aattgctgca ggaaggaggg 120 
agcacctcci czaaqaztcc tcccncacaa ctcaangtcg gcggggtcca ccaccccaat 180 
ggccagcggg naggaicgct gagtatgaac gat :c:ggc: tgagcattc: roaggcccat 240 
atcttcccgc ucagaacgat aggcgttgac tgcgagtaga tgcacgttcc cggtggacac 300 
tttctcgacc aagacccggc tcttttcgat gcctcatttt tcaatatgag tactgaagtc 360 
gccagtgcac gtcccgcgat cgttgtccag ttgtgtacgg atcagaagcg gaataaaccc 420 
atgctaagac tgccgaatag tgtatggacc cccagtaccg actcatac:: gaagtcctu 480 
atgaggcgcc cgaagccggt atgtattata ttccttggtt tcccacgtcc g:at::aact:c 540 
cccatggcnc cgcagcggga attcctctcg aacaggtctc cggctcceac actggggttt 600 
ttgcaggaac catgtatcac gactaccaag gccccitcca gcgccaacca* gaagcccttc 660 
cacggtattt cataacagga aatgctggca ccatgctcgc gaatcgcgtc tcccacttt t 720 
atgaccttcg tgggcccagt gtctcgatcg acactgcctg tcccacaacc ttaacagcct 780 
tgcatcttgc cattcagagc ttgcgagctg gagaatctga tatggcgatt gtcgctggca 840 
cgaacctgct actcaatcct gacgtcttta ctaccatqtc caaccttggg tgagtctggt 900 
gttcaatcca tctagtgatc agcattcttg ttgcacagac aatatgtgat gttaactgtg 960 
atgtgctgcg accagcttcc tttcgtccga tgggatttcc tactcatttg actcgagagc 1020 
ggatggctat ggtcgcggag aaggagtggc tgcgatcgtc ttgaagactc tgcccgatgc 1080 
ggtgcgagac ggagacccga tccgcctcat agtgcgcgaa acggcaatca accaagacgg 1140 
ccggacccca gccatcagca cgccgagcgg cgaggcccag gagtgcctga tccaagattg 1200 
ctatcagaag gcccagttgg acccaaaaca gacttcgtac gttgaggccc atgggacggg 1260 
aaccagagca ggagatccgc tggagcttgc agtcatctcg gccgcgttcc cgggacagca 1320 
gatacaggtg ggctccgtga aagccaatat cgggcataca gaggctgtca gtggtctggc 1380 
gagtttgata aaggtggctc tggctgttga aaagggggtt atcccgccta atgcaaggtt 1440 
cctccagccg agcaagaagt tgctcaagga cactcatatc caggtagcac taccttcacg 1500 
attttttcct ctcattctat tctttctatt ccagctcctc gctgatttac aaacagattc 1560 
cactgtgtag ccaatcacgg ataccaaccg atggtgtccg tcgcgcatca ataaacaact 1620 
tcggtttcgg aggcgcaaat gctcatgcaa tcgtggagca atatggcccg tttgcagaaa 1680 
catcgatctg cccacctaat ggttattctg gcaactatga tggcaattta ggaacggatc 1740 
aagcgcatat atatgtgctg agtgccaagg atgagaacag ttgcatgaga atggtttcaa 1800 
ggctgtgcga ctatgctacc cacgccagac cagccgacga tttgcaattg ctcgcgaata 1860 
tagcatacac gcttggttct cgtcgctcga acttccgatg gaaggcagta tgtacggcac 1920 
acagcctcac gggtcttgcc cagaatttgg cgggagaagg catgcggcca agcaagtcag 1980 
ccgaccaagt aagactggga tgggtgttca caggccaggg agcgcaatgg tttgcaatgg 2040 
gtcgtgagtt gattgagatg tatcctgtct ttaaagaggc cctgctggaa cgcgatggat 2100 
atatcaagga aatggggtca acctggtcca ttataggtaa agacccgcaa caagtccccg 2160 
gcccaggcta tggaaagcac tcactcatgt caccattgca gaggaactca gtcgccctga 2220 
aacggaaagt cgcgttgatc aggcagaatt cagtctgcca ttgtctacgg ctcttcaaat 2280 
tgcgcttgtt cgtctgctct ggtcgtggaa catccaacca gtagccgtca ctagtcactc 2340 
cagcggagag gcagctgcag cgtacgctat cggggcacta acagcccgct cggccattgg 2400 
aataagctat atacgcggtg cattgacagc aagagaccgc ctggcgtcgg tacataaggg 2460 
gggcatgttg gctgtcggat tgagccgcag tgaagtgggt atatacatca gacaggttcc 2520 
attacagagt gaagaatgct tggtggtggg gtgtgtcaac agcccgtcga gtgtgacggt 2580 
ctcgggagat ttgtccgcca ttgccaagtt ggaggaactg ctccatgctg atcgtatatt 2640 
tgcgagacgg ctgaaagtca cccaagcctt tcactccagc cacatgaact cgatgacaga 2700 
tgctttccga gccggtctta cagaactctt cggagcagac cccagtgatg cagcaaacgc 2760 
cagtaaagat gtgatctacg cttctcccag aaccggggcc cgcctgcacg acataaatcg 2820 
tcttcgggat cctatacact gggtcgaatg catgcttcac ccggttgagt tcgaatcagc 2880 
attccgtcga atgtgcctgg acgaaaacga ccacatgcca aaggtcgata gggtcattga 2940 
gattggacct cacggagcgc ttggaggccc gatcaagcag atcatgcagc ttccagagct 3000 
tgccacgtgt gacatccctt atctgtcctg tctttctcgt gggaagagct ctctgagcac 3060 
ccttcgcctt ctcgcatcag aacttatccg ggccggattt cctgttgact tgaatgcgat 3120 
caactttccc cgcggatgtg aagcagctcg ggtccaagtg ttgtctgatc taccgcccta 3180 
cccttggaac cacgagacca gatactggaa agagccgcgc atcagccaat ctgcccggca 3240 
gcggaagggc ccagtccacg atctgatcgg attgcaggag ccgttgaacc tgccgttggc 3300 
gcggtcatgg cacaatgtgc ttcgtgtgtc agatttgcca tggctacgcg accacgtcgt 3360 
cggctcgcat attgttttcc ctggggctgg gttcgtgtgt atggcagtga tgggaatcag 3420 
cacgctctgc tcgtccgacc atgaatctga cgacatcagt tacatcctac gcgacgtgaa 3480 
ctttgcgcag gccctgattc tacctgcgga cggggaagaa ggaatagatc tgcgcctcac 3540 
gatttgtgct cccgatcaga gtctgggttc acaggactgg caaagattc: tagttcattc 3600 
gatcactgct gacaagaatg actggacgga acactgtacg ggacttgttc gagcagaaac 3660 
ggaccagcct ccctccagtt tgtcgaacca acaacggata gacccacggc catggagccg 3720 
taaaacggcg ccgcaggagc tgtgggactc actacatcgg gtgggaattc gtcatgggcc 3780 
cttttttcga aacattacgt gcatcgaaag cgacgggcga gggtcatggc gtacacttgc 3840 
catcgcggac acggcctccg caatgccaca cgcctacgaa tcccagcaca ttgttcaccc 3900 
aaccacacta gactctgcag ttcaggcagc ctataccact cttccattcg ctgggagccg 3960 
gatcaaatct gcgatggtcc ccgctcgcgt cggctgcatg aagatttcct cccgacttgc 4020 
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agatttggag cccagggaca tgctgcgccc acaaccgaao atccacagcc aaactcct tc 4080 
cgcattggta acccatgtag cagtttccga :cacgcega t ccggtcggag ggcctcttat 4140 
ggagctcgaa gccctggtct ttcagtctcn cccggcaact ctgggcactt ctgaccggga 4200 
ctccaccgac cccgggaata cttgcagccc ccggcactgg gctccagaca tcagct tagt 4260 
taaccccggc cggcccgaaa aaaccctggg cacaggtatt caggagcacg agatcagcct 4320 
catattggag cttcgacggt gttcggtgca cttcattcaa gaggccatgg aaagtttgag 4380 
cgtaggcgat qtcgagaggc tgagtggcca tctggccaaa ttctatgcgt ggatgcagaa 4440 
acaactggcg tgtgcccaaa atggcgagct ggggccagag agctccagct ggactcggga 4500 
tagcgagcag gcaagatgca gcctccgctc nagagtggtt gctggtagca ccaacggcga 4560 
aatgatctgt cgcctgggct ccgtgctccc cgctatccta cgtcgggaag ttgatccgt t 4620 
ggaggtgatg atggatggcc acctgttgtc ccgctactat gtcgatgccc tcaagtggag 4680 
tcggtccaac gcgcaagcca gcgagctcgt gcgcctctgc tgccacaaaa acccgcgcgc 4740 
tcgcatactg gaaatcggcg gaggcaccgg gggttgcacc cagctggtcg tggactcctt 4800 
gggcccaaat ccgccggtag gccgctatga ctttactgac gtctcggccg ggttttttga 4860 
agcagcccgc aagcggttcg cgggatggca gaatgtgatg gattttcgga agttggacat 4920 
cgaggacgat ccagaagcgc aggggtttgt gtgcggatcc tacgacgtgg tgttggcttg 4980 
tcaggtcctg catgccactt ctaacatgca gcgcacattg actaatgtgc gcaagctgtt 5040 
gaagccagga ggcaaactca ttcttgtcga aaccaccaga gacgagcttg acttgttttt 5100 
cactttcggg ctcctgcccg gctggtggct cagcgaagaa ccagaaagac agtcgactcc 5160 
gtcactaagc cctacgatgt ggcgcagcat gctgcacact actggattca atggtgtgga 5220 
agttgaggct cgtgactgcg atagccacga gttctatatg attagcacca tgatgtccac 5280 
ggccgtacag gcgactccga tgtcatgctc ggtcaaattg cctgaagtgc tcttggtcta 5340 
tgttgactca tctacgccca tgtcttggat atcagatttg cagggagaga ttcgcggcag 5400 
gaattgttcc gtcacttcgc tacaggcact tcgtcaagtt cctcctaccg agggccaaat 5460 
atgcgtattc cttggagagg tggaacactc catgcttggt tcagtcacca acgacgactt 5520 
cacacttttg acctcaatgc tacagctggc rgggggaact ttatgggtca cccaaggagc 5580 
gacaatgaag tctgatgatc ccctgaaggc cctacacccc ggattactac gtaccatgcg 5640 
taatgaaagc catggcaagc gatttgtctc acttgacctc gacccttcgc gtaatccatg 5700 
gacaggcgat tcgcgcgatg ccattgtcag tgttctggat ttaattagca tgtcagatga 5760 
aaaggagttt gactatgcag agcgggatgg agttatccat gttcctcggg catttagtga 5820 
ctccatcaat ggaggcgagg aagacgggta tgccttggag ccattccagg acagccagca 5880 
tctcctgcga ctagatatac agactcctgg gctcctcgat tccctgcact tcacaaagcg 5940 
caatgtggac acatatgaac cagataaatt accggacgac tgggtagaga ttgaaccgag 6000 
ggcgtttggt cttaacttcc gtgacatcat ggtcgcgatg ggtcaattgg aatcaaacgt 6060 
catgggcttc gaatgcgccg gcgtggttac aagtctcagc gagacagcaa gaacaattgc 6120 
acccgggctt gcggtcggag atcgggtttg cgccctcatg aacggacact gggcgtcgag 6180 
ggtgaccaca agccggacca acgtggtgcg cattccagag actcttagtt tcccgcatgc 6240 
tgcctccatc cctctggcct tcacaacagc ttacatttca ctttacaccg ttgcccgcat 6300 
tctgccaggt gaaacggtgt tgatccatgc cggggcagga ggcgtaggcc aggcggccat 6360 
tattcttgct caattaaccg gtgctgaagt ctttacaact gctggcagtg agaccaagcg 6420 
taaccttttg atcgataaat tccacctcga ccctgatcat gtcttctcga gcagggactc 6480 
cagcttcgtc gacggtatca agacccgcac ccgtggcaag ggggtggacg tggttttgaa 6540 
ctcgctagct gggcctctcc ttcagaagag ctttgactgt ctggctaggt ttggtcggtt 6600 
tgtagaaatc ggcaagaagg atcttgagca gaatagccga ctcgacatgt cgacgttcgt 6660 
ccgcaatgtc tccttctcct ccgttgatat tctctactgg cagcaagcga agcccgctga 6720 
aatcttccag gcgatgtccg aggtcatctt gctgtgggag cgaacggcaa tcggcctgat 6780 
tcatccaata tcagagtatc ctatgtcggc cctggagaag gcctttcgca ctatgcagag 6840 
cggccagcac gttgggaaga ttgttgtgac agtagccccc gatgacgcgg tcctcgttcg 6900 
tcaggaacga atgccactat ttctgaagcc taacgtgtcg tatcttgttg ctgggggcct 6960 
gggtggtatc ggacggcgga tctgcgagtg gctggtcgat cgcggggcgc gatatctcat 7020 
cattctgtct cgaactgctc gcgtggaccc ggtcgtgacg agtctccaag agcggggctg 7080 
caccgtttct gtacaggcgt gtgatgtggc cgatgaaagc cagcttgaag cggctctcca 7140 
acagtgtcgg gcggaggaaa tgcctccgat ccggggcgtc atccaagggg caatggttct 7200 
caaggacgcc ctcgtctcgc aaatgacggc ggacgggttc catgccgccc tgcggcccaa 7260 
ggttcaggga agttggaatc tgcaccgaat tgcatcggac gtggatttct tcgtgatgct 7320 
ctcatccttg gtgggtgtca tgggaggcgc aggacaagcc aactacgcgg ctgccggagc 7380 
gtttcaggac gcgctcgcag agcaccgcat ggctcacaac cagccagcgg tcaccatcga 7440 
cctcggaatg gtccagtcaa t tgggtatgt" agcagagaca gattctgctg tggcggaacg 7500 
actccaacgg atcggctatc aacccttgca cgaagaggag gttctggacg tcctcgagca 7560 
agctatatct cctgtgtgtt cccctgccgc acccacacgg cctgctgtca tcgtcaccgg 7620 
catcaacacc cgcccaggcc ctcactgggc acacgccgac tggatgcaag aggctcgctt 7680 
tgcggggatc aagtatcgtg atccgttgag ggacaatcat ggagctttgt cgctgacccc 7740 
ggcggaagat gacaatcttc acgccaggct gaaccgtgca atcagccaac aggagtcaat 7800 
cgccgtgatc atggaggcga tgagctgcaa gctcatctca atgttcggcc tgacgaatag 7860 
cgaaatgtcc gccactcaga cattggcggg gatcggcgtg gactccctgg tcgccattga 7920 
gctccggaac tggatcacag ctaagttcaa tgttgatatc tcagttttcg agttgatgga 7980 
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<210> 30 
<211> 888 
<212> DNA 

<2i3> Aspergillus terreus 
<400> 30 

atggcgacgc aggaattctit aagcgatgtc tcctccggae tcttgtctgc tgaagccata 60 

aggtacagag tgaagacggg ngtatccatg gatggatgga tgtatgtgga gatcgcacct 12C 

tacatataaa catacatcat atcaggaaaa ggggatattc gtgtaactct gtgcgaaccg 18C 

acgataaaca tcacttaaga cacctaacca atatagggtt agacacgcct ccgtgtccca 24C 

aatcccttcc ggctgcgcac tcggcggtag catcttgtct cacgttcgtt ccgccggacc 300 

catgtgaaaa ttgggaggcg ctgcaggtag cgtgggacaa ggcttgttgc aggaatccaa 36C 

cgccgttgtt ctttatctgc gtttctcttc tgttttcttt ctattccctc tggctgcagc 42C 

gtggcgggtg cgggcgatac ggtgggttgc accgtgtctc taaagtgttt cccaaagtat 480 

ggcccgacga catggattcc cagctaccct caagactaca aaccttagca agtagtaagt 540 

agcttcatag acctatccta attaacctac actaactaaa cccagcacgt ctggggggtt 600 

tcaacccgta atctgcagca gatcctagag cgtaaacccg aacctgcccc aaacaactct 660 

acatacatct caaagggcta tgcaacattc ttcaaccaat tttccttacc atccgtagat 720 

gttacacaga tcctcaatca gacgttgcag caccacgatg ttgagactat taacctggat 780 

tgtggcagtg gcctcttaac cctgcggacc cagctaagga tcttattgat agggaaacct 840 

aagataataa aaccattttc cggtctacgg acgagcatta atgaataa 888 



<210> 31 
<211> 2478 
<212> DNA 

<213> Aspergillus terreus 
<400> 31 

atggagagtg cagagctgtc gtcgaagcgg caggcatttc ctgcatgtga tgagegccgg 60 
atccgtaagg tccgatgcag caaggagggt ccaaagtgct cccattgcct ccgatataac 120 
ctaccctgtg aattctccaa caaagttaca cgggtcaacc aaacagcaaa attgtaagtg 180 
ataagatggt ctatattcgg gtcaaatacc tgtctttctg actctcagtc agggcgcgcg 240 
acgtcgagaa gctcgggagt cgggttggag atatcgaaca tgccctccaa cgatgcctgt 300 
cctttattga tgcccatcag ggctttcgtg atctatcaag gccacagtca caagaaagcg 360 
ggtacacaag ctcaaccagc tcagaagagt gtgaagtaaa cttgtactca ggcaaacaca 420 
cttcacccac cgaggaagat ggattctggc ctctccacgg ttatggctct tttgtttcac 480 
tcgtcatgga ggcacaggct gctaacgcca acctaa'cctc ttggttaccg gtcgatatga 540 
ccagcggcca agtcgcagag atggtcgcat ttgaccgcca agctgtgtca gctgtgcgct 600 
cgaaggtggc tgaggcgaat gaaacgcttc aacagatcat tgaggatatc ccaacactat 660 
cggcatccga aaacgatacc tttctcccgt ctcttccacc ccgcgctcta gtggagccgt 720 
ctatcaacga atatttcaag aagctgcatc cacgactccc cacattcagt cgacagacta 780 
ttatggacgc agtggaatct cagtacacaa tcagaactgg gcctccggac ctggrttgga 840 
ttacctcttt caactgcatt gtgcttcagg ccctgactca aacatcaatt gcgaacaaag 900 
tcgtgggatg cacaggacaa gacataccaa tagattatat gatcataagc ctgctgcgta 960 
atatcaggca gtgctataat cgattggaaa ctcttgttaa accccggcta tcgaatatac 1020 
gggccctctt ttgtttggtg cgtcttacaa ccctaccttt ggaacgtggc ctgactttat 1080 
tatactaagg cacttgtggc aatggagtat tttgatttcg caatttttct gactatcttt 1140 
gctcaagtct gcgagttgtc caggctcatt ggactccatt taacgacaac gaccccgcca 120C 
acggaagatg gggctgtggg cgaccagcct aaagacttgt tctggagcat cttcctcgtc 1260 
gatgttcgtc aaaaacttgc ttcgttttct gagcatttcg cttactatag gattcagaag 1320 
cacgtatcca tcattggggg caaggcctgc ctattgccct cgtatgactg cagcgtacca 1380 
ttgcctccat atgactccgc tgcgccacta ccaaatgctt ttgcggcacg catacgcttg 1440 
gcattcattc ttgaggagat atatctgggc ttatactcag caaaatccag caaaatggaa 150C 
cagagtcgcg tccgccgccg tatccgcaga attgctcgaa aacttagcca gtggcacgtg 1560 
caacatgagc atgtactgcg taccggagat ccgaataggc ctctcgaaga gtatatctgt 1620 
gcaacgcagt tgaggtttgc actctcgagc tgttgggtac ttctgcataa acgcatttgg 1680 
agccaggaaa ggggcgctgt ctgcctacaa cacgctcggg attgtctgat gctgttcaag 1740 
caattgtgcg atgggtgtaa atctggtttc agcaatttcg acaggtaagt cttccgtgcg 1800 
accctcttga cagcatccta acctggggat gtacttcata gcattgtcct gaactattct 1860 
ttgatctcat tcatgggaat ctatgtccac attgtggagg aagaccagcc gatccattca 192C 
caggacatgg agatactcac tttcttcgcc atatacacga accgttcggc atccaatagg 1980 
tcatctgcat ctatctcgta caaattaagc caagtggcca gtcgctgtag cgatattgcc 2040 



59 



-3NSDOCID: <WO 0037629 A2 I > 



WO 00/37629 



PCT/US99/29583 



ctcctcctcc 
accccctcat 
acaaccagca 
ggccaggtct 
actcct tctg 
t tccccgact 
actagccaaa 
ctaaatacac 



agaat ttaac 
ggaacgagcc 
ccggctct t c 
gggaca tata 
aggargcaac 
tctcacttgg 
gcgaat t egg 
cca 1 1 tea 



egagaggegt 
aacctaca tg 
ana taact eg 
cctcaacccg 
ccaggat 1 1 g 
cat cgacaac 
:cttattacc 



1 1 tattccga 
gat tacga tg 
aaiatcagcc 
agagaaa t ac 
c tgagcaa t g 
- tct ccgact 
gaggaggaca 



caacgata tc 
tcgccaatgc 
cget t ggtg t 
caa tgga tgg 
a tcctogcca 
t tccact tgg 
z aa z l cga z a 



acgaagtcca 
gt ccsctagc 
acccggagac 
cacaat tgcg 
ecgccc tggt 
ca t tgaca tg 
tgagagacta 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2478 



<210> 32 
<211> 1302 
<212> DNA 

<213> Aspergillus terreus 
<400> 32 

atggagtcta aagtccagac aaatgttcca 
cgtgggaagc gtgtatgtgt tccttcttgg 
ttctcttaat atatgtatag acgaaaggca 
gggcagttga aatctccatc acctaccctt 
ttcegcttta ccgtggccgc caactgacag 
tcgcgcccag ettaagegga gaaaccatga 
ctggtatgct gggtatggag ccaccttggt 
gtagegatec cccttataag cccgcgtgga 
acagaatttg cctccttcaa tatttatege 
tcaactggag cttccgtcct ggctgggttt 
gtaaccccag eggaggegat caagacgaaa 
gatcagttct taattcgttc tcttagcatt 
agtacgactt teggegegat agctggaatc 
tctgcggttg gtcctacaat tttgcggcag 
tataacgaac ttattgggct ggcccgaaaa 
ctggcaagca ecttggtegg ttctgttact 
ctggacgtga teaagacgeg gtaagtgctg 
tatatgetga cttggatgcg acttegggtt 
ctgtacggaa atacctttaa ctgcgtgaaa 
ttctggtccg gcgtctggtt teggacaggg 
cccgtgtaag tttaggagta atctacaggc 
aggtacgaga aagtctacaa gttcttgacg 



ttaccaaagg caccccttac ccaaaaagcc 60 

tcgcgcgtgt gccatgttac tgacaatgtc 120 

ttcctgeatt ggtcgcgggt gcttgtgctg 180 

tcgaatgtga gcttatcctg tgtttaagag 240 

tetattgett ccgctggtag eggctaaaac 300 

tgtggcagct ataaaacctg gaatccgagg 360 

aggaaccaca ttgaaagect ccgttcgtat 420 

gcgaaaagga atgaccgttt gcaataacaa 480 

tcggccctct cgggcccaaa tggagagctc 540 

ggggctggcg tgaccgaggc tgtcttagee 600 

atgtaagttg caacatctca cccgttatcc 660 

gatgeaagga aggttggaaa tgcagagtta 720 

cttcgagatc ggggaccget tggattcttc 780 

tegtccaatg cggcagtgaa gttcactgtt 840 

tactcgaaga aeggegaaga cgtgcaccct 900 

ggagtttgtt gcgcctggtc gacacagcca 960 

ctcagatcga cagtaacccg cccagataag 1020 

accagaatgc aatctcttca ggcaagacaa 1080 

acacttctgc gcaatgaagg cattggggtt 1140 

agactttccc ttacctcggc catcatgttt 1200 

atgatattct tgtacactga cagagegtea 1260 

cagccaaact ga 1302 



<210> 33 
<211> 1529 
<212> DNA 

<213> Aspergillus terreus 
<400> 33 

atgaccaagc aatctgcgga cagcaacgca aagtcaggag ttaeggcega aatatgecat 60 
tgggcatcca acctggccac tgacgacatc cctccggacg tattagaaag agcaaaatac 120 * 
cttattctcg atggtattgc atgtgcctgg gttggtgcaa gagtgccttg gtcagagaag 180 
tatgtgcagg caacaatgag etttgagecg ccaggggcct gcagggtgat tggatatgga 240 
caagttagtt ctatccaatc tgaacagtct acaaagtata ctgacgatcc tttgtataga 300 
aactggggee tgttgcagca gccatgacca attctgettt catacaggct aeggagcttg 360 
acgactacca cagcgaagcc cccctacact ctgeaagcat tgtcctccct gcggtctttg 420 
cagcaagtga ggtcttagee gagcagggca aaacaatttc tggtatagct gtcattctag 480 
ccgccattgt ggggtttgaa tctggcccgc ggatcggcaa agcaatctac ggateggace 540 
tcttgaacaa eggctggcat tgtggagccg tgtatggtgc tccagccggt gcgctggcca 600 
caggaaagct ccttggtctg actccagact ccatggaaga tgctctcgga ategegtgea 660 
cgcaagcctg tggcttaatg teggegcaat aeggaggcat ggtcaagege gtgcaacatg 720 
gattegcage gcgtaatggt cttcttgggg gactgttggc ccatggtggg tacgaggeca 780 
tgaagggtgt cctggagaga tettaeggeg gtttcctcaa aatgttcacc aagggcaatg 840 
geagagagee tccctacaaa gaggaggaag tggtggccgg teteggttea ttctggcata 900 
cctttactat tegcatcaag ctctatgcct getgeggact tgtccatggt ccagtcgaag 960 
ctatcgaaaa ccttcagagg aggtaccccg agctcttgaa tagagecaac ctcagcaaca 1020 
ttcgccacgt tcatgtacag ctttcaacag cctcgaacag tcactgtgga tggataccag 1080 
aggagagacc catcagttca ategcaggge agatgagtgt cgcatacatc ctcgccgtcc 1140 
agttggtcga ccagcaatgt cttctggccc agttttccga gtttgatgac aacttggaga 1200 



60 



NSDOCID: <WO O037629A2 I > 



WO 00/37629 



PCT/US99/29583 



aoccaaaaot grqcaarcnq gccaggaagg ttactccatc tcatagcgaa gagtttgatc 1260 



aagacggcaa ctgtctcact gcgggtcgcg 
ttacggaaac tgtcgagaac 



ttcacaaata ccgaaccctt 
aggaccttgt cctcagcccc gacaggctca 
attgtcccgt aaaatcqcca ctggcataa 



tgaggamga qzzcaacqaz gccccctctg 
cctcttggag zcaaacaccc catgccaaac gaaccgattc 
gctggtagcg rgacgcacga aacccgggtc aaagagattg 
gaca::ac cccactgctg gagctgcTita 



1320 
1360 
1440 
1500 
1529 



<210> 34 
<211> 1*704 
<212> DNA 

<213> Aspergillus terreus 
<400> 34 

atgggccgcg gtgacactga gtccccgaac ccagcgacga cctcggaagg tagcggacaa 60 
aacgagccag agaaaaaggg ccgtgatatt ccattatgga gaaaatgtgt carracgttc 120 
gttgttagtt ggatgactct agtcgttact ttctccagta cttgrcttct ncctgccgcc 180 
cctgaaatcg cgaatgaatt tgatatgact gtcgagacta tcaatatctc caatgctggt 240 
gtcttggtcg ccatgggata ttcatccctc atatggggtc ccatgaacaa gttagtcggc 300 
aggcggacat catacaatct ggccatttca atgctttgtg cgtgctccgc tggaacggca 360 
gcggcgataa acgagaaaat gttcatagcg ttcagagtat tgagcggctt aaccggaacc 420 
tcgttcatgg tctcaggcca aactgttctt gcagatatct ttgagcctgt acgcatcaca 480 
cgcccttgtc tccccaattg cgaaaactaa tccgttcgtg cgcaggttta ccgtgggacg 540 
gccgtaggtt tcttcatggc cgggactctt tctggccctg caataggtac gtaccctgct 600 
gaaagtacta gaactcccaa caggaactga ctgtttgatc aggcccctgc gtgggagggg 660 
tcatcgtcac tttcacgagt tggcgtgtta tcttctggct tcaactaggt atgagcggac 720 
tggggctcgt gctttccctg ctatttttcc cgaaaaccga aggaacttct gagaaggtct 780 
caacggcgtt taaaccgacc acacttgttt caatcatatc gaaattctcc ccaacggatg 840 
tgctcaagca gt.gggtgtat ccaaatgtct ttcttgccgt aagtgcctgg gagatatgcc 900 
ctctgcatct actggaaacg aaatgctcat gccgcaaaca aaaggactta tgctgtggcc 960 
tcctggcgat tacgcaatat tcgatcctga ctccagctcg tgctatattc aactcacggt 1020 
ttcatttaac gactgcccta gtatcgggtc tcttctacct cgctccaggt gccgggttcc 1080 
tgatagggag tctcgtcggc ggtaaacttt cggatcgcac cgttcggaga tacatagtaa 1140 
agcgcggatt ccgtctccct caggatcgac tccacagcgg gctcatcaca ttgtttgccg 1200 
tgctgcctgc gggaacgctc atttacgggt ggacactcca agaggataag ggtgggatgg 1260 
tagtgcccat aatcgcggcg ttcttcgcgg gctgggggct catgggcagt tttaactgcc 1320 
tgaacactta cgtggctggt ttgttccaca ccctcattaa tttatcccct ttgtgtacat 1380 
gcccacaata atgttgtctc tgaccgcaag tagaagcctt gccacggaac cggtctgcag 1440 
tcattgcagg aaagtatatg attcaatact ccttttctgc agggagtagt gcgctcgttg 1500 
tgcccgtcat agacgccctc ggagttggat ggacgttcac gctatgtatg gtacctttct 1560 
tcttacctct caatgtacat gctcacagtt gttgcaggtg tggttgcttc gactatagct 1620 
ggattgatca cggcggccat cgcacggtgg gggataaata tgcaaaggtg ggcagaaagg 1680 
gctttcaacc tgcctaccca atag 1704 



<210> 35 
<211> 1704 
<212> DNA 

<213> Aspergillus terreus 
<400> 35 

atgggccgcg gtgacactga gtccccgaac 
aacgagccag agaaaaaggg ccgtgatatt 
gttgttagtt ggatgactct agtcgttact 
cctgaaatcg cgaatgaatt tgatatgact 
gtcttggttg ccatgggata ttcatccctc 
aggcggacat catacaatct ggccatttca 
gcggcgataa acgagaaaat gttcatagcg 
tcgttcatgg tctcaggcca aactgttctt 
cgcccttgtc tccccaattg cgaaaactaa 
gccgtaggtt tcttcatggc cgggactctt 
gaaagtacta gaactcccaa caggaactga 
tcatcgtcac tttcacgagt tggcgtgtta 
tggggctcgt gctttccctg ctatttttcc 
caacggcgtt taaaccgacc acacttgttt 
tgctcaagca gtgggtgtat ccaaatgtct 



ccagcgacga cctcggaagg tagcggacaa 60 
ccattatgga gaaaatgtgt cattacgttt 120 
ttctccagta cttgtcttct tcctgccgcc 180 
gtcgagacta tcaatatctc caatgctggt 240 
atatggggtc ccatgaacaa gttagtcggc 300 
atgctttgtg cgtgctccgc tggaacggca 360 
ttcagagtat tgagcggctt aaccggaacc 420 
gcagatatct ttgagcctgt acgcatcaca 480 
tccgttcgtg cgcaggttta ccgtgggacg 540 
tctggccctg caataggtac gtaccctgct 600 
ctgtttgatc aggcccctgc gtgggagggg 660 
tcttctggct tcaactaggt atgagcggac 720 
cgaaaatcga aggaacttct gagaaggtct 780 
caatcatatc gaaattctcc ccaacggatg 840 
ttcttgccgt aagtgcctgg gagatatgcc 900 
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ctctgcatct 
tcctggcga t 
treat ttaac 
tgaragggag 
ageceggatt 
rgctgcctge 
t agtgcccar 
tgaacacr ta 
gcccacaata 
t cattgeagg 
tgcccgtcat 
tcttacctct 
ggatrgatca 
act t tcaacc 



actggaaacg 
r aegcaa t a r 
gactgcccra 
rctcgtcggc 
ccgtcrccct 
gggaacgetc 
aa t cgcggcg 
eg t ggcr ggt 
a tgt tgtctc 
aaagt a ta tg 
agacgccctc 
caa tg tacat 
cggcggcca t 
tgcctaccca 



aaargctcar 
tcga tccrga 
etaregggtc 
ggraaact 1 1 
caggatcgac 
ar t tacgggt 
t tct tegegg 
rtgttccaca 
tgaccgcaag 
attcaa tact 
ggagttggar 
gctcacagt t 
cgcacggtgg 
a tag 



gccgcaaaca 
ct tcagcrcg 
cctrctacci 
egga tegcac 
tccacagcgg 
ggacactcca 
gctgggggct 
ccctcat taa 
ragaagcert 
cert t tetge 
ggacgttcac 
gt tgcaggtg 
ggga taaa t a 



aaaggact ta 
tger a ta t tc 
cget ccaggr 
cgtt eggaga 
gctcatcaca 
agaggataag 
catgggcagr 
t tra tcccct 
gccacggaac 
agggagt agt 
gctatgtatg 
tggt tget tc 
tgcaaaggtg 



rgctgt ccec 
a a c r ca egg t 
geegggr rcc 
racatagt aa 
t tgtt tgccg 
ggtgggargg 
tt taactgee 
ttgtgtaca t 
eggtctgeag 
gcgctcct tg 
gtacct t tct 
gac ta t agct 
ggcagaaagg 



96C 

102C 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1704 



<210> 36 
<211> 1503 
<212> DNA 

<213> Aspergillus terreus 



<400> 36 

a tgaagcccg 

ctgaaeggea 

accaa tagt c 

gcccacgatc 

actaacgaga 

ttccttcaga 

gaatccta tt 

atagctggca 

ggcgtctcgc 

gcgctagatg 

aacgacgaag 

ggggaccctg 

aaagggaatt 

a-a tggcccac 

gttgggactc 

acaaacgagc 

gt taaagcat 

atgggca tgc 

agcattgt ta 

aacgtggttc 

tt tgctgact 

tgtaccagac 

cccattgcag 

gaacgcgccg 

tcaactgcgc 

tat 



caatcct tat 

agctcacat t 

caccggaca t 

tggccctgga 

ctcatccaac 

ggttgatagc 

catcagcact 

gtgacegteg 

cat tgttctg 

ttcagaaagt 

cgcccgcct t 

cct tcatctc 

acctctggcc 

tageggacta 

ectget tgaa 

aatcccagtt 

tta tgaggga 

ggggtctagg 

gctggcagga 

aaccat ttgt 

gecaggtctg 

caggtcacat 

aatgaaactg 

ccgcgctgtt 

geggea taca 



gaaatactgg 
gagtgagaca 
eta t a tcgac 
ct ttgggcgc 
ategatggee 
ggatcataag 
ggtgttgggt 
tggggecate 
gtggacggat 
ccagggtcca 
gcataactgg 
aegtt tctac 
ggcgatgtgg 
etaeggagtg 
agcccatgct 
tetaaaeggg 
gggtgtaatt 
egatgetgea 
atccg tgcta 
ccta tttgat 
tgacacagga 
tgatatat cc 
ggcgttctgg 
acaagtggat 
gccacggtgc 



ctcttcgtct 
aaggtgacgg 
cccgatgatt 
gtctttggta 
ateategctg 
etcgaegt ta 
ccagccaaag 
tatggcttat 
gttaccccaa 
ccgtcagtga 
attc'ttgeaa 
gcccatgtct 
tcaaatatgt 
gtaa tgggca 
gactacgaaa 
acgtgggact 
aggagecaac 
.tcgccgacac 
tcggacatcc 
gt taggatcc 
actgggaact 
tgatgacaat 
gggcgcagga 
caacacagct 
tcagacagta 



cagctgtgag 
gggccgttca 
eggtctcagt 
aaaatgecac 
gtaccataga 
ccagcatccg 
gca tacagaa 
acgatatatc 
ccaaacttga 
agtatcgtgg 
at tatggega 
tcgagctgat 
tttatgttga 
ctagccacac 
aagaaccgat 
gga ttagcaa 
actgggagac 
ttaaegcaac 
tgaataaaac 
attcaccct c 
tacta tgaga 
gcaggcaata 
att tactatc 
caactgatca 
tgggttgcca 



cgcgtcaacc 
gctggct tgt 
ggt tegegea 
agt teger tc 
taagt caacc 
tggccagrgg 
tgcgctagtc 
tgaacaaat t 
tgccatctac 
aatttttatc 
ggttgagaac 
cctgcgcctg 
tgacaccaac 
tggta tgacg 
ggctcgagca 
cgaggtcaat 
cgcatacaca 
agtggaagaa 
caacctgtcg 
aaata tarcg 
geggcargae 
tgetgegtet 
at tttgacat 
ggacctggga 
atattgggga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1503 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. | [ Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. | | Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. | | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 

1 ' I Y I As a " rec ! uirecl additional search fees were timely paid by the applicant, this International Search Report covers all 
L - A - J searchable claims. 

As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
1 1 covers only those claims for which fees were paid, specifically claims Nos.: 



□ 

No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest I The additional search fees were accompanied by the applicant's protest. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-11, 16-22, 24-29 (complete) and 23 (partially) 

The D4B gene cluster from Aspergillus terreus comprising the 
0RF1, 0RF2, lovA, lovB, 0RF5, LovC, lovD, HMG CoA reductase, 
LovE, ORF10 and part of the lovFA genes involved in the 
biosynthesis of lovastatin. Uses thereof in a method for 
increasing the production of lovastatin in a 
lovastatin-producing organism, for increasing the production 
of monacolin J in a lovastatin producing organism, and for 
increasing the production of monacolin J in a 
non-1 ovastat in-producing organism; fragments of the D4B gene 
cluster comprising the gene encoding for the esterase-1 ike 
gene (0RF1, SEQ ID N0:20), the gene encoding 0RF2 (SEQ ID 
N0:21), the lovA gene (SEQ ID N0:22), the gene encoding 0RF5 
(SEQ ID NO: 23), the lovC gene (SEQ ID NO: 24), the lovD gene 
(SEQ ID N0:25), the gene coding for the HMG CoA reductase 
(SEQ ID N0:26), the lovE gene (SEQ ID N0:27), the gene 
encoding 0RF10 (SEQ ID N0:28) and the lovB gene (SEQ ID 
N0:29); a lovastatin-producing organism genetically modified 
to increase lovastatin production and a 
non-lovastatin-producing organism genetically modified to 
produce monacolin J or to produce lovastatin. 



2. Claims: 12-15 (complete) 

A method of increasing the production of lovastatin in a 
lovastatin producing organism comprising the step of 
transforming an organism with the LovE gene from A. terreus. 



3. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:30) encoding the 0RF12 polypeptide (SEQ ID N0:11). 



4. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:31) encoding the zinc finger polypeptide of SEQ ID N0:12. 



5. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 

NO: 32) encoding the acetyl -CoA transport polypeptide of SEQ 
ID N0:13. 



6. Claim : 23 (partially) 
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An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:33) encoding the 0RF15 polypeptide (SEQ ID NO: 14). 



7. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:34) encoding the membrane transport polypeptide of SEQ ID 
N0:15. 



8. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
N0:35) encoding the P450 polypeptide of SEQ ID N0:16. 



9. Claim : 23 (partially) 

An isolated nucleic acid from Aspergillus terreus (SEQ ID 
NO:36) encoding the 0RF18 polypeptide (SEQ ID NO; 17). 
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